

SEQUENCE LI5TIS 



)1G 



cllQ> Stanton, Jr., Vincenu 



<12 0> GENE SEQUENCE VARIANCE 



,7 GENES RELATED 



TO FOLATE METABOLISM HAVING UTILITY IN DETERMINING THE 
TREATMENT Or DISEASE 

<I30> 11926-015001 

<I40> 0-9/658,659 
<141> 2000-09-08 

<150> 09/596,033 
<151> 2000-06-15 

<150> 09/357,743 
<151> 1999-07-20 

^-150> 09/357,024 
<151> 1999-07-19 

<150> 60/093,484 
<151> 1998-07-20 

<160> 16 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 7224 

<212> DNA 

<213> Homo sapiens 



<222> 194, 3209 
<223> n = c or g 

<221> misc_f eature 

<222> 1136, 1334, 3150, 5551, 5934 
<223> n = a or g 

<22l> misc_f eaturs 

<222> 284, 1252, 1599, 5573, 5659, 5678, 5874 
<223> n = c or t 

<221> misc_f eature 
<222> 3207 
<223> n = g or t 

<221> misc_f eature 
<222> 5444 
<223> n = c or a 



<220> 

<221> misc feature 



■400> I 

aaaggttcta aatgtctgcg gggcccagag ccggacgnca cgccgtcccc czzzgccggz SO 

crtctcctgg gnccttttcc gtgccgrccc gcga c:ccgc crctggccgc gcgi:gcct:gg :20 

ccgctaggcc gacaccaagg accggccggg zacccgggaa gaaagcacgt gccccagcag 130 

ctgccgcgcc cagncccgag agaggcccza gggcgccgcg ggcttncggg gcccgcagcc 240 

cccccgcgac gcgagccaac gggaggcgrc naaagacccg ggcnttgtgt ggcaggct eg 3 00 

cctggcgctg gctggcgtgg cccctggccg zcgzzacczg eggagagcac gr cttctcng 360 

ccgcgccctc tgcgcaagga ggagac~cga caacatgcca cccgcgcucc aagacctgtc 4 2 0 

gcaacccgaa ggtctgaaga aaaccctgcg ggacgagacc aacgccattc cgcagaagag 4 80 

gantatggtg ctggatggag ggatggggac cacgauccag egggagaage caaacgaaga 54 0 

acacttccga ggtcaggaat ttaaagacca cgccaggccg ctgaaaggca acaatgacat 500 

tttaagtata actcagcctg aegtcatena ccaaacccat aaggaatact tgctggccgg 660 

ggcagatatc attgaaacaa ataccttcag eagcaccagt at.tgcccaag ctgactaegg 72 0 

ccttgaacac ttggcctacc ggatgaacat: gtgcwCtgca ggagtggcca gaaaagctgc 730 

cgaggaggta actctccaga caggaactaa gaggtntgtg gcaggggctc tgggtccgac 340 

taataagaca ctctctgtgt ccccatctgt ggaaaggccg gattatagga acatcacatt 900 

tgatgagctt gttgaagcat accaagagca ggccaaagga cttctggatg gcggggttga 960 

tatcttactc attgaaacta tttttgatac tgccaatgcc aaggcagect tgtttgcacc 1020 

ccaaaatctt tttgaggaga aatacgcccc ccggcccatc tttatttcag ggacgatege 1080 

t:gataaaagc gggeggaetc tctccggaca gacaggagag ggatttgtca t:cagcnt*gt:c 1140 

ccatggagaa ccactctgca ccggatnaaa zzgzgczzzg ggtgcagcrg aaatgagacc 1200 

- tttattgaa ataattggaa aacgcacaac agcc.tzatzgtc czctgztazc enaatgeagg 1260 

tcttcccaac acctttggtg acuatganga aacgccctct atgatggcca agcacctaaa 1320 

ggattttget atgnatggct tggtcaacac agttggagga - tgctgtgggt caacaccaga 1380 

tcatatcagg gaaattgctg aagctgtgaa aaamgcaag cctagagttc cacctgccac 1440 

- tgcttttgaa ggacatatgt cactgrctigg cccagagccc utcaggattg gaccgcacac 1500 

caactttgtt aacattggag agegpegraa tgttgcagga.- tcaaggaagt- ttgctaaacu 1560 

catcatggca ggaaactatg aagaagcect gtgtgttgcc aaagtgcagg tggaaatggg 1620 

ageccaggtg ttggatgtca acatggatga eggcatgeta gatggtccaa gcgcaat.gac 1680 

cagattttgc aacttaatng cttccgagcc agacaccgca aaggtacctt tgtgcatcga 1740 

ctcctccaat tttgctgtga ttgaagctgg gttaaagtgc tgccaaggga agtgcattgc 1300 

caatagcatt agtctgaagg aaggagagga cgacctcttg gagaaggeca ggaagattaa i860 

aaagtatgga getgetatgg tggtcatggc rtttgacgaa ■ gaaggacagg caacagaaac 1920 

agacacaaaa atcagagtgt ' gcacccgggc ccaccanctg cctgtgaaaa aacegggctzt 1980 

taatccaaat gacattattt ttgaccctaa catcccaacc actgggactg gaatggagga 2040 

acacaacttg tatgecacta actttaccca tzgcaacaaaa grcattaaag aaacattacc 2100 

eggagecaga ataagtggag gtctttccaa czzgzcczzc nccttccgag gaanggaagc 2160 

cattcgagaa geaatgeang gggctttccr ttaccatgea accaaguctg gcauggacac 2220 

ggggscagtg aacgctggaa acctcccugt: gtatgacgat acccataagg aactxetgea 2280 

gctctgtgaa gatctcatct ggaataaaga ccctgaggcc actgagaagc tettaegtta 2340 

tgcccagact caaggcacag gagggaagaa agccactcag actgatgagt ggagaaatgg 2400 

ccctgtcgaa gaacgccctg agtatgecer tgtgaagggc attgaaaaac atattattga 2460 

ggatactgag gaagccaggt taaaccaaaa aaaatatccc cgacctctca acataattga 2520 

aggacccctg atgaatggaa tgaaaattgt . tggtgatctt tttggagctg gaaaaatgtt 2580 

tctacctcag gttataaagt cagcccgggt tatgaagaag gctgttggcc accttatccc 2640 

tttcatggaa aaagaaagag aagaaaccag agtgcttaac ggcacagtag aagaagagga 2700 

cccttaccag ggcaccatcg tgctggccac tgttaaaggc gacgtgcacg acataggcaa 2 760 

gaacatagtt ggagtagtcc ttggctgcaa taatttccga gttattgatt taggagtcat: 2 820 

gactccatgt gataagatac tgaaagctge ccttgaccac aaagcagata taattggccc 2880 

gtcaggactc atcactcctt ccccggatga aatgattttt gttgecaagg aaatggagag 2940 

attagctata aggattccat tgttgattgg aggagcaacc acttcaaaaa cscacacagc 3000 

agttaaaata gctccgagat acagtgcacc tgtaatccat gtcctggacg cgtccaagag 3 06 0 

tgtggtggtg tgttcccagc tgttagatga aaacctaaag gatgaatact ttgaggaaac 312 0 

catggaagaa tatgaagata ttagacaggn ccattatgag tctctcaagg agaggagata 318 0 

cttaccctta agtcaageca gaaaaantng tttccaaatg gattggctgt ccgaaccrca 3240 

cccagtgaag cccacgttta ctgggaccca ggtcrttgaa gactatgacc tgeagaagee 3300 

ggtggactac attgacegga agcctttccc tgacgtctgg cagctccggg gcaagtaccc 3360 



^aaccgaggc :: :t:cccaaga uactcaacga :?,aaacaa:a rracggagagg ccaggaaggt ;42 0 

-racgatgac gcccacaata cgctgaacac accgacragu caaaagaaac nccgggcccg 348 0 

gggcgtggtt: gggctctggc cagcacagag ~a-ccaagac gacatccacc cgcacgcgga 2 54 0 

ggccgccgtg ccccaggccg cagagcccac agccacc::: "atgggccaa ggcaacaggc 3 500 

zgagaaggac tctgccagca cggagccata craccgccr: -cagacurca ccgccccctt: 3660 

gcatccnggc atccgngact acctgggcct gtctgccgtc gcccgccttg gggcagaaga 2 720 

gccgagcaag gccuatgagg acgacggcga cgaccacagc agcaccatgg ncaaggcgct 3 780 

gggggaccgg ctggcagagg ccnttgcaga agagcnccac gaaagagccc gccgagaact 3 340 

gtgggcctac tgtggcagcg agcagctgga cgtcgcagac ccgcgcaggc cgcggtacaa 3 900 

gggcatccgc ccggctcctg gccaccccag ccagcccgac cacaccgaga agctcaccac 2 960 

gcggagactt gcagacatcg agc'agtctac aggcartzagg ctaacagaat cattagcaat 4 02 0 

ggcacctgcc tcagcagtct caggcctcta cntctccaac ttgaagtcca aatattttgc 4080 

tgtggggaag atttccaagg atcaggttga ggattacgca ccgaggaaga acatatctgt 4140 

ggctgaggtt gagaaatggc ttggacccac tttgggatac gatacagact aacttttttt 42 00 

ttttttgcct tttttattct tgatgatcct caaggaaaca caacctaggg tgccttaaaa 4260 

ataacaacaa caaaaaacct gtgtgcatct: ggctgacact tccctgcttc tggttttcga 4320 

agactattta gtggaacctt gtagaggagc agggccttcc tgcagtgcct ggaaaacagg 4380 

cgctgctttt ttgggacctt gcgtgaagag cagrgagcag ggttcctgtg gtttccctgg 4440 

tccctctgag atggggacag actgaagaca gaggtcgctt gatttcaaag caagtcaacc 4500 

tgcrtttttc tgtttttaca gtggaatcta ggaggccact tagtcgtctt tttttcctct 4560 

^agaagaaaa g.cctgaaacc gagctgaana gagaagcgng accctgtgac aaaacgatac 462 0 

-grgagaaac ggggcactnt: aarctaagcg gcc'ataacag uggaccctga cggggaaggt 4680 

gtagctctgt tctcttcgga agaccnegut rccnaaaggc cggaccaaat: ggctgcagaa 4740 

ccccctttgg caaaaggcar gcgctcaccg ctrgcntgtc agaaacactg aagccatttg 4800 

ccccagtgtg gtcaagcagc catgctttcr gggcatuctc gticcccccat aatttcatat 4860 

ttccgnaccc ctgaggaaac aaaaaggaaa tgaggagaga aagttactgt taagggtggt 4 920 

taacattttt tttgttttgt tttg^ttttgg cttttttttt tttgagacag ■ agtctggctc 4980 

cgtcgcccag gctggagtgc aggggcgcaa ccrcggccca tagcaagctc cgcctcctgg 5040 

gttcatgcca ttctcctgcc tcagcctcca gagtagccgg gactacaggt gcccgccacc 5100 

acacccggct aattttttgt gtttttacaa aatacaaaaa agtagagaca ggatttcact 5160 

gtgttagcca ggatggtctst gatctcccga ccticgtigatc tgcccacctc agcctcccaa 5220 

aatgctggga ttacaggcgt gagccaccga gcctggccgg ttaacatctt ttaattgttt 5280 

ccaggattga gcaggttctc agctgggccc tgatatcccg tgcggagttg gacaagtggg 5340 

cagcataaag tcactcattt cttaccattt cattcccctc aattctcaat atattcagra 5400 

atgaagaatg gtgccaccac tcaagcaaca agcctcaaac tcanccatgt catctttttc 5460 

ctggacgatt gcagttattt caaaaatttg catgcaaaat atacactcat cctacttcaa 5520 

gatggtggng gcaatagtca ggagaaggca ncattggagt cctggcttga ctngaaggat 5 58 0 

gaagacgaag aagcaaggga ggaacaaacg aagaaccatc cttgttcatg aataggaata 5640 

ctcaagacta taaaggnanc aggcctccca aaattganct atggatxtaa caccattttc 5700 

aatggaaatt ccaacagatt ttattgaatg aaacaagcag gtgtttatat ggagtagcaa 5760 

aggacttaaa attaccaaat gcttctaaat atgaaggaga ggttggggac acgcacccra 5 820 

tgtgatacca agttttattg tcaagacagt gtcatggtgc agaggtaggc attntgagca 5 8 80 

ggggaacaaa ataagggcct agaaacccac ccgtgcatat gttgaccttt gcanaatgac 5940 

ctggtgacat ggcaagtcag tggggacagg aaggaccact ccctaagtaa tcccagaaca 6000 

atggctattc atgtgggaaa aaaagaaatt ttactttctc tcaccttacc tggtgataag 6060 

ttccaaatat gttaagggct ttaatacaaa aagcaaaaat tgtcagtgtt tggatgaaaa 6120 

aagccttagg gcaggaaaga atctcttgag acataaagta gtaatcataa aggacaagat 5180 

ggttaagtca attctgttaa aactcaaggc ttatattaag caaacacttg aagtgagaag 6240 

atgatccaca acttgagaag acatttataa tacaaataac tgatgaagga ' ttcacaatca 6300 

caaatataga gaattcctat ttaaaaaaat agaaaaatag tgaagactac acaagaggaa 536 0 

atagggcttt taaataaaca gatgttctgr agcattggtc agggaaatat gaattaggac 6420 

cacaatgaga ttccatttta tatccataag atttgcaaag gttgggtctg acagtaccag 6480 

ttgttagatc tgtagggact tgtacaacat tgtggatgtg taaacaggca ccactgcttt 5540 

aaaaaacaat tatcccttac agacttgaac atttgcagac cttatgatct tgcttccaac 5600 

tcccacctgt atgtccagca aactcttgca tgtggccact aggaggaacg tgtaagaatg 6660 

ttcacagtta catatttata atagttaaca actggaaaaa gtgaaatgta tgtctgtcta 5720 

caggaaaata ggtgaataat tagatatatg tattcattct acgggatatt attcagcagt 5 78 0 




j ^3.3a Z gag Z 


gaaccacagc 


:a:acc:cac 


aacasgaazg 




adcac" aagg 


5 340 


aaaaaagcaa 


gcttgaagag 


accacatggg 


gcgcaccanc 


rartgagc 


ccaaaaacaa 


6900 


gcaaaac ca.a 


agaacatgza 


gcctaagcac 


acgtatacaa 


~aaaacca~g 


" z a zz aaaa a 


6960 


aaaaggtaac 


:gataaacca 


aaattgagca 


tiagtaancac 




aggaagtgga 


"020 


agggacagga 


gcacataggc 


agatgccaag 


ctacgcagcc 




-^ccnggcag 


"080 


gcttacaagt: 


gtttactaca 


cgctattaac 


acattacacc 


itscaaCwSa 


cagacaacag 


7140 


tcuntcacat 


attaaatatg 


ctctacccaa 


acatactaca 


aaaaacaaag 


gcaaagtgga 


^200 


atgataaccx 


aaaaaaaaaa 


aaaa 








7224 



< 2 10 > 2 
;211> 6972 
^212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> 3434, 4313, 5255, 5507, 5310, 6128, 6526. 5636 
<223> n = c or t 

<221> misc^feature 
:222> 4799, 5455 
•:223> n = a or g 



<400> 2 

cgcccccgcc tctgagcccc cttcccatgg cggccccagt gtzggaggac gggtcggucc 60 

tgcggggcca gccctttggg gccgccgtgt egacngccgg ggaagnggcg uctcaaaccg 120 

gcatggtcgg ctaccccgag gccc£cactg atcccuccta caaggcacag atcttagtgc 180 

tcacctatcc tctgatcggc aactatggca tccccccaga cgaaatggat gagttcggtc 240 

tctgcaagtg gtttgaaccc tcgggcatcc acgtagcagc accggtagtg ggagagcgcc 300 

gtcctactcc ■ cagccactgg agtgccaccc gcaccccgca tgagcggccg cagcagcatg 360 

gcatccccgg cttgcaagga gtagacacuc gggagct:gac caagaagccg cgggaacagg 420 

ggtctctgct ggggaagctg gtccagaatg gaacagaacc ttcatccctg ccattcttgg 480 

accccaatgc ccgccccctg gcaccagagg tctccactaa gaccccacgg gtattcaata 540 

cagggggtgc ccctcggate cttgctttgg actgtggcct caagtataac cagatccgar 600 

gcctctgcca gcgtggggct gaggtcactg tggtaccctg ggaccatgca ctagacagcc 660 

aagagcatga gggtctcctc ctaagtaatg ggcctggcga cccrgccccc taccccagcg 720 

ticgtatccac actgagccgc gttttatctg agcctaaccc ccgacccgcc tttgggacct: 780 

gcctgggaca ccagctaccg gccttagcca ttggggccaa' gacntacaag acgagatang 340 

ggaaccgagg ccataaccag ccctgcxtgt; cggngggctc cgggcgccgc cctctgacat 500 

cccagaacca tgggtttgct gtggagacag actcactgcc agcagactgg gctcccctct 960 

tcaccaacgc caatgatggt tccaatgaag gcattgtgca caacagcntg cctttcttca 1020 

gtgtccagtt tcacccagag caccaagctg gcccttcaga tatggaactg cntttcgata 1080 

tctttctgga aactgtgaaa gaggccacag ccgggaaccc tgggggccag acagctagag 1140 

agcggctgac tgagcgcctc tgtccccctg ggattcccac tcccggccct ggacttccac 1200 

caccacgaaa ggttctgatc ctgggctcag ggggcctctc cattggccaa gctggagaat 1260 

ctgactactc gggctctcag gcaattaagg ccctgaagga ggaaaacatc cagacgttgc 1320 

tgatcaaccc caatattgcc acagtgcaga cctcccaggg gctggccgac aaggtctatt 13 80 

ttcttcccat aacacctcat tatgtaaccc aggtgatacg taatgaacgc cccgatggtg 1440 

tgttactgac ttttgggggc cagac.tgctc tgaaccgtgg tgtggagctg accaaggccg 1500 

gggtgctggc tcggtatggg gtccgggtcc tgggcac'aac agtggagacc attgagctga 1560 

ccgaggatcg acgggccctt gctgccagaa tggcagagat cggagagcat gtggccccga 1620 

gcgaggcagg aaattctctt gaacaggccc aggcagccgc tgaacggctg gggtaccctg 1680 

tgctagtgcg tgcagccctt gccgtgggtg gcctgggctc tggctttgcc T:ctaacaggg 1740 

aggagctctc tgctctcgtg gccccagctt ttgcccatac cagccaagtg ctagtagaca 1800 

agtctctgaa gggatggaag gagattgagt acgaggtggt gagagacgcc tatggcaact 1860 

gtgtcacggt gtgtaacacg gagaacttgg acccactggg catccacact ggtgagtcca 1920 

tagtggtggc ccctagccag acactgaatg acagggagta tcagctccng aggcagacag 1980 



^atcaaggt gacccagcac cr.gggaaccg czggggagcg caacgcgcag zazgccccga 2 04 0 

accccgagtic tgagcagcat tacaccactg aagngaacgc caggccctct: cgcagcccrg 2100 

ccctggccag taaggccaca ggncat ccac tggctizatgt ggcagccaag c-agcatcgg 2160 

gcatcccucc gcctgagcsc aggaaccccg cgacaggggg cacagcagcc :~:gaaccca 222 0 

gcgtggacca ccgtgcggng aagactcccc gangggaccr -agcaagttc -cgcgaguca 2280 

gcacaaagat tgggagcngc acgaagagcg ctggrgaagt: catgggcact: gggcgntcar 2 34 0 
ttgaggaggc cntccagaag gccctgcgca cgguggacga gaaccgcgcg ggcc^ngacc . 2400 

acacagtgaa accagtcagc gatatggagt tggagactcc aacagacaag cggacccczg 2460 

cggcggcagc tgctttgngg gcnggncant eagtggaccg cccgcacgag c::acacgcs 2520 

tcgaccgcug gtccccgcac cgaatgaagc gtatcatcgc acacgcccag czgcuagaac 2580 

aacaccgcgg acagccnttg ccgccagacc cgccgcaaca ggccaagcgt crcggccrrc 2640 

cagacaaaca gattgccctt gcagtcctga gcacagagct ggctgttcgc aagctgcgcc 2700 

aggaactggg gatctgccca gcagtgaaac agattgacac agttgcagct gagcggccag 2 760 

cccagacaaa ttacccatac ctaacgtatt ggggcaccac ccacgacctc acctuncgaa 2320 

cacctcatgt cctagtcctt ggctctzggcg tctaccgtat tggcnccagt g~tgagtttg 2880 

actggtgtgc tgtaggctgc atccagcagc tccgaaagat ggganataag accatcatgg 2 940 

tgaaccataa cccagagaca gtcagcaccg actatgacat gtgtgatcga ctctactttg 3 000 

atgagatctc ttttgaggtg gtgatggaca tctatgagct cgagaaccct gaaggtgtga 3 060 

tcctatccat gggtggacag ctgcccaaca acatggccat ggcgutgcac cggcagcagt: 3120 

gccgggtgcc gggcaccccc ccrgaagcca ttgacccggc tgagaaccgn tzcaagtttit: 3180 

cccggcccct tgacaccatt; ggcatcagcc agccccagcg gagggagcxc agcgaccccg 32 40 

agcccgctcg ccaantcrgc cagaccgtgg ggracccctg cgtgg::gcgc ccrcrccuacg 3 300 

tgctgagcgg tgctgctatzg aangtggcct: acgcggatgg agac.ccggag. cgcttcctga 3 3 60 

gcagcgcagc agccgtctcc aaagagcatc ccgnggtcat ctccaagttc acccaggagg 3420 

ctaaggagat tgangtggat gccgcggcct ccgatggtgt ggtggcagcc atcgccaccc 3480 

ctgagcatgt ggagaatgca ggcgtgcatt ' caggcgatgc gacgcrggtg acccccccac 3 540 

aagatatcac tgccaaaacc ctggagcgga tcaaagccat tgtgcatgct gtgggccagg 3 600 

agctacaggt cacaggaccc ttcaatctgc agctcattgc caaggatgac cagctgaaag 3660 

ttattgaatg caacgtacgt gcctctcgct ccttcccctt cgtttccaag acactgggcg 3 720 

tggacctagt agccctggcc acgcgggtca tcatggggga agaagtggaa cctgtggggc 3 780 
taatgactgg ttctggagtc gtgggagtaa aggcgcctca gttctccttc tcccgcttgg ■ 3 840 

cgggtgctga cgtggtgttzg ggtgcggaaa tgaccagtac tggggaggtg gccggctttg 3 900 

gggagagccg ctgtgaggca tacctcaagg ccatgctaag cactggctt't aagatcccca 3 960 

agaagaatat cctgctgacc attggcagcc ataagaacaa aagcgagcxg cccccaactg 4020 

tgcggctact ggagagcccg ggctacagcc tctatgccag tctcggcaca gctzgacT:tct 4 080 

acactgagca tggcgtcaag gtaacagctg tggaccggca ctttgaggag gccgtggatg 4140 

gtgagtgccc accacagcgg agcatcccgg agcagctagc cgagaaaaac . zrtgagccgg 4200 

ngartaacct gtcaatgcgn ggagctgggg gccggcgtct ctcczccztt grcaccaagg 4260 

gctaccgcac ccgacgcctg gccgccgact tctccgtgcc ccnaaccanc ganaccaagc 4320 

gcaccaaact ctttgtggag gccctaggcc agatcgggcc agcccctcct ttgaaggcgc 4380 

atgttgactg tatgacctcc caaaagcttg tgcgactgcc gggattgatt gatgtccatg 4440 

tgcacctgcg ggaaccaggt gggacacata aggaggactt tgcctcaggc acagccgctg 4500 

ccctggctgg gggtatcacc atggtgtgtg ccatgcctaa tacccggccc cccatcattg 4560 

acggccctgc tctggccctg gcccagaagc- tggcagaggc tggcgcccgg tgcgacnttg 4620 

cgctattcct tggggcctcg tctgaaaatg caggaacct-t gggcaccgtg gccgggtctg 4680 

cagccgggct gaagctttac ctcaatgaga ccttctctga gctgcggctg gacagcgtgg 4740 

tccagtggat: ggagcatttc gagacatggc cctcccacct ccccattgtg gctcacgcng 4800 

agcagcaaac cgtggctgct gtcctcatgg tggctcagct cactcagcgc tcagtgcaca 4860 

tatgccacgt ggcacggaag gaggagatcc tgctaattaa agccgcaaag gcacggggct 4920 

tgccagtgac ctgcgaggtg gctccccacc acctgttcct: aagcc'atgat gacctggagc 4980 

gcctggggcc tgggaagggg gaggcccggc . ctgagcttgg ctcccgccag gatgtggaag 5040 

ccctgtggga ggacatggct gtcatcgact gctttgcctc agaccatgct ccccacacct 5100 

tggaggagaa gcgtgggtcc aggcccccac ctgggttccc agggrtagag accatgctgc 5160 

cactactcct gacggctgta agcgagggcc ggctcagcct ggacgacctg ctgcagcgac 5220 

tgcaccacaa tcctcggcgc atctttcacc tgccnccgca ggaggacacc tatgtggagg 5280 

tggatctgga gcatgagtgg acaattccca gccacatgcc cttctccaag gcccactgga 5340 

caccttttga agggcagaaa gtgaagggca ccgtccgccg tgtggccctg cgaggggagg 5400 



rzgcczanat cgacgggcag gccccggtac :• rcrcgggcta - ggacaggac gnacngaagc : - 50 

ggccacaggg ggctgccccc cagcccccac cczcagcccc ngccacnagt gagangacca 5 520 

cgacacctga aagaccccgc cgcggcatcc cagggcc*:cc cgatggccgc ctccaccrgc 5 5 30 

cgccccgaac ccancgagcc ^ccgacccag g-tcgccagc cgaggagcca aaggagaagu 5640 

cctctcggaa ggtagccgag ccagagccga t:gggaacccc cgatggcacc cgctaccc:c ~^00 

caccaccagt accgagacag gcacctcccc agaacecggg gacccctggc ttgccgcacc 5^60 

cccagacctc accccngcug cacccattag tgggccaaca uaccctgtcn gtccagcagc 5320 

ccaccaagga tcaga.tgtct cacccgctca atgtggcaca cacactgcgc acgatggcgc 5 3 SO 

agaaggagcg gagcctcgac acccrgaagg ggaaggccac ggcctccacg ctctacgaag 5 940 

tgagcacacg gaccagcagc tcctttgcag cagccacggc ccggctggga ggtgcxgcgc 5 0 00 

tcagcttctc ggaagccaca tcgtccgtcc agaagggcga auccctggcc gaccccgngc 5 06 0 

agaccatgag ctgctatgcc gacgncgtcg cgccccggca cccccagcct ggagcagngg -512 0 

agctggcngc caagcactgc cggaggccag tgaccaatgc cggggatggg gccggagagc 5180 

accccaccca ggccctgctzg gacaccttca ccatccgcga ggagctggga actgtcaacg 6240 

gcatgacgat cacgatggtg ggtgacctga agcacggacg cacagtacac tccctggcct 5 300 

gcctgctcac ccagtatcgt gtcagcctgc gctacgtggc acctcccagc ctgcgcatgc 5360 

cacccactgt gcgggccttc gtggcctccc gcggcaccaa gcaggaggaa ttcgagagca 6 420 

ttgaggaggc gctgcctgac actgatgtgc tctacangac tcgaatccag aaggaacgat 5480 

ttggctctac ccaggagcac gaagcttgct etggtcagct catcctcacc ccccacatca 5 540 

tgacccgggc caagaagaag atggcggcga cgcacccgat gccccgcgtc aacgagacaa 5600 

gcgcggaagt ggacncggau ccccgngcag ccnacunccg ccaggcngag aacggcacgt 5 5 60 

acatccgcat ggcuccgcta gccacngcgc cgggccgcct: ccaggggccr: ggcctccrca 5720 

gcctcttctc tttaggceca gcngccgggc aaggaatrcc agtgcccccc acgggggcag -5 780 

cacacttaga cattcccgga catccagatt gcccacatgt- gctgaccaca cttcaggccc 5340 

cggactggag ctctctggca tgggggtggg gcctcagacg ccggggccca gtctgcccca 5 900 

■tctt'cattcc tgcaccttaa acctgcacag ccantnrtct actgacxtaa caaacagccg 5 960 

agctgtcccc tg 5 972 



<210> 3 
<211> 3951 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> 156, 3432, 3682, 3937 

^223> n = z or c 

<221> misc_£ aature 

<222> 577, 638, 1708, 3730, 3925 

<223> n = a or g 



<400> 3 

gctgtcactt ggctctctgg ctggagcttg aggacgcaag gagggtttgt cactggcaga 60 

ctcgagactg taggcactgc catsggcccct gtgcccagta aggactcggc ggacatcgag 12 0 

agtatcctgg ctttaaatcc tcgaacacaa actcatgcaa ctctgngttc cacttcggcc 18 0 

aagaaattag acaagaaaca ttggaaaaga aatcctgata agaactgctt taattgtgag 240 

aagctggaga ataattttga tgacatcaag cacacgactc ttggtgagcg aggagctctc 300 

cgagaagcaa tgagatgcct gaaatgtgca gatgccccgt gtcagaagag ctgtccaact 36 0 

aatcttgata ttaaatcatt catcacaagt accgcaaaca agaactatta tggagctgct 420 

aagatgatat: tttctgacaa cccacttggt ccgacttgtg gaatggtatg tccaaccrcn 480 

gatctatgtg taggtggatg caatttatat gccactgaag agggacccat taatattggt 54 0 

ggattgcagc aatttgctac tgaggtattc aaagcantga gtatcccaca gatcagaaat 600 

ccttcgctgc ctcccccaga aaaaatgtct gaagcctntt ctgcaaagat tgctcttttt 660 

ggtgctgggc ctgcaagtat aagttgtgct tcctttttgg ctcgattggg gtactctgac 72 0 

atcactatat ttgaaaaaca agaatatgtt ggtggtttaa gtacttctga aattccccag 780 

ttccggctgc cgtatgatgt agtgaacttt gagattgagc taatgaagga ccttggtgna 34 0 




aagacaacct gcggcaaaag czzzzcaqza na-gaaa^ga ::::i:agcac -zcgaaagaa ?oo 
aaaggccaca aagctgcntt caccggaaca ggunrgccag sacccaacaa agacgccacc 36 0 

~t:ccaaggcc tgacgcagga ccaggggtrt :acaca:cca aagactuttr gccactxgra 1020 

gccaaaggca gtaaagcagg aacgtgcgcc -gccacccc: cacngccauc gacacgggga 1080 

gtcgtgactg cacttggagc cggagacact: gcccccgact: g*:gcaacacc cgccccacgc 1140 

cgtggagctc gccgagtgtt caccgtcrrc agaaaaggct ttgttaacat aagagccgcc 1200 

cccgaggaga tggagcttgc caaggaagaa aagcgngaac ztctgccatt cccgtcccca 1260 

cggaaggtta tagtaaaagg tgggagaacc gctgcracgc agtttgttcg gacagagcaa 1320 

gatgaaactg gaaaatggaa tgaagacgaa gaccagangg tccatccgaa agccgatgtg 13 80 

gccatcagtg cctttggtcc age tec gage gaucccaaag caaaagaagc cttgagcccn 1440 

ataaaactta acagatgggg tctcccagaa gtagacccag aaactatgca aactagtgaa 1500 

gcacgggtat ttgcaggtgg tgacgtcgct ggcttggcta acactacagt ggaatcggcg 1560 

aatgatggaa agcaagcttc ttggtacatt cacaaacacg tacagccaca atatggagcr 1620 

tccguttctg ccaagcctga actacccctc tttcacactc crattgatct ggtggacatt 1680 

agtgcagaaa tggccggatt gaagtttnca aatccctttg gtcttgctag cgcaactcca 1740 

gccaccagca catcaatgat tcgaagagct tttgaagcng gacggggttt tgccctcacc 1800 

aaaactttct ctcttgataa ggacautgtg acaaacgttc cccccagaat catccgggga 1860 

accacctctg gccccatgta tggccctgga caaagcrcct: ttctgaatat tgagctcatc 1920 

agtgagaaaa cggctgcata ctggcgtcaa agtgccactg aactaaaggc tgacttccca 1980 

gacaacattg tgattgctag cacratgcgc agttacaaca aaaatgactg gacggaact^ 2040 

gccaagaagc ctgaggattc cggagcagan gcccrggagt tiaaacttatc acgtccacac 2100 

ggcargggag aaagaggaat gggcccggcc -gtgggcagg acccagagct ggngcggaac 216 0 

auctgccgct: gggttaggca agctgttcag actcc:::t: ccgccaagct gaccccaaac 2220 

gccac.tgata ttgtgagcac cgcaagagct: gcaaaggaag gtggtgccaa tggcgttaca 2280 

gccaccaaca ctgtctcagg cctgatggga ccaaaacctg ■ atggcacacc ttggccagca 2340 

gtggggattg caaagcgaac nacanatgga ggagtg^ctg ggacagcaat cagacctatc 2400 

gctttgagag ctgtgacctc catt^gctcgt gctctgcctg gatttcccat tttggctacc 2460 

ggtggaattg actctgctga aagtggtctt caguttctcc atagtggtgc ttccgtcctc 2520 

caggtatgca gtgccattca gaatcaggac ttcactgtga ccgaagacta ctgcactggc 2 580 

ctcaaagccc tgctttatct gaaaagcant gaagaactac aagactggga tg.gacagagt 264 0 

ccagctactg tgagtcacca gaaagggaaa ccagttccac gtatagctga actcatggac 2 700 

aagaaactgc caagttttgg accttatctg gaacagcgca agaaaatcat agcagaaaac 2 760 

aagattagac tgaaagaaca aaatgtagcc ttttcaccac ttaagagaag ctgttttatc 2820 

cccaaaaggc ctattcctac catcaaggat gtaataggaa aagcactgca gtaccttgga 2880 

acatttggtg aattgagcaa cgtagagcaa gutgtggcta tgattgacga agaaatgtgt 2940 

atcaactgtg gtaaatgcta catgacctgt aatgactctg gctaccaggc tatacagrtt: 3 000 

gacccagaaa cccacccgcc caccacaacc gacacttgta caggctgtac tctgtgtccc 3060 

agtgtttgcc ctattgtcga ctgcatcaaa acggtttcca ggacaacacc ctatgaacca 3120 

aagagaggcg tacccttatc tgtgaatccg gtgtgctaag gtgatttgtg aaacagttgc 3180 

tgtgaacttt catgtcacct acatatgctg acctcncaaa atcatgatcc ttgtgttcag 3 24 0 

ctctttccaa attaaaacaa atatacactt tctaaacaaa aatatgtaat ttcaaaatac 3 3 00 

atttgtaagt gtaaaaaatg tzctcatgtca atgaccattc aattagtggc ataaaataga 3360 

ataattcttt tctgaggata gtagttaaat aacxgtgtgg cagttaattg gatgttcacc 3420 

gccagttgtc tnatgtgaaa aattaacttt ttgtgtggca attagtgtga cagtttccaa 3480 

attgccctat gctgtgctcc atatttgatt ■ tctaattgta agtgaaacta agcattttga 3540 

aacaaagtac tctttaacat acaagaaaat gtatccaagg aaacatttta ccaataaaaa 3 600 

ttacctttaa ttttaatgct gtttctaaga aaatgtagtt agctccataa agtacaaatg 3660 

aagaaagtca aaaattattt gntatggcag gataagaaag cctaaaattg agtttgtgga 372 0 

ctttattaan taaaat cccc ttcgctgaaa ttgctcattt ttggcgttgg atagaggata 3 780 

gggagaatat ttactaacca aataccattc actactcatg cgtgagatgg gtgtacaaac 3840 

tcatcctctt ttaatggcat ttctctttaa accacgttcc taaccaaatg agatgatagg 3900 

atagatcctg gttaccactc ttttnctgtg cacatanggg ccccggaatt c 3951 



<210> 4 

<211> 2816 

<212> DNA 

<213> Homo sapiens 





r.isc_r.eature 
1"5, ' 1067 
r. = g or a 




misc feature 



341 



223 > n = c or g 

<221> misc_f eature 

■:222> 791, 1997, 2618, 2653 

■-:2 23> n = t or c 

<221> misc_f eature 
<222> 1337 
<223> n - c or a 

<221> miscJE eature 
<222> 2107 

- 2 2 3 > nucleotide in position 2107 is g , ;r absent 

v221> rnisc_f eature 
<222> 2583 
<223> n = t or g 

<400> 4 

gggccgggtc cgggagcccc agggcagccg ccccgccgag ncgcaggcac agtgtcacct 6 0 

tcgtcccctc cggagctgca cgtggcctga gcaggatggt gccctccagc ccagcggtgg 120 

agaagcaggt gcccgtggaa cctgggcctg accccgagct ccggtcctgg cggcncctcg 180 

tgtgctacct ttgcttctac ggcttcatgg cgcagatacg gccaggggag agcttcatca 240 

ccccctacct cctggggccc gacaagaact tcacgcggga gcaggtcacg aacgagatca 3 00 

cgccggtgct gtcgtactcc tacctggccg tgctggtgcc ngtgttcctg ctcaccgact 360 

acctgcgcta cacgccggtg ctgctgctgc aggggctcag cttcgtgtcg gtgtggctgc 420 

tgctgctgct gggcca.ctcg gtggcgcaca tgcagctcat ggagctcttc tacagcgtca 480 
ccatggccgc gcgcatcgcc tattcctcct acatcttctc tctcgtgcgg cccgcgcgct . 540 

accagcgtgt ggccggctao tcgcgcgctg cggtgccgcc gggcgtgttc accagctccg 500 

cgctgggcca gctgctggtc actgtgggcc gagtctcctt ctccacgctc aactacatct 560 

cgctggcctt cctcaccttc agcgtggtcc tcgccctctt cctgaagcgc cccaagcgca 720 

gcctcttctt caaccgcgac gaccgggggc ggtgcgaaac ctcggcxtcg gagctggagc 780 

gcatgaatcc nggc.ccaggc gggaagctgg gacacgccct gcgggtggcc tgtggggact 340 

cagtgctggc gcggatgctg cgggagctgg gggacagcct gcggcggccg cagctgcgcc 900 

tgtggtccct ctggtgggtc ttcaactcgg ccggctacta cctggtggtc tactacgtgc 960 

acatcctgtg gaacgaggtg gaccccacca ccaacagtgc gcgggtctac aacggcgcgg 1020 

cagatgctgc ctccacgctg ctgggcgcca tcacgtcctt cgccgcnggc ttcgtgaaga 1080 

tccgctgggc gcgctggtcc aagctgctca tcgcgggcgt cacggccacg caggcggggc 1140 

tggtcttcct tctggcgcac acgcgccacc cgagcagcat ctggctgtgc tatgcggcct 12 00 

tcgtgctgtt ccgcggctcc taccagttcc tcgtgcccat cgccaccttt cagattgcat 1260 

cttctctgtc taaagagctc tgtgccctgg tcttcggggt caacacgttc tttgccacca 1320 

tcgtcaagac catcatnact ttcattgtct cggacgtgcg gggcctgggc ctcccggtcc 1380 

gcaagcagtt ccagttatac tccgtgtact tcctgatcct gtccatcatc tacttcttgg 1440 

gggccatgct ggatggcctg cggcactgcc agcggggcca ccacccgcgg cagcccccgg 1500 

cccagggcct gaggag.tgcc gcggaggaga aggcagcaca ggcactgagc gtgcaggaca 1560 

agggcctcgg aggcctgcag ccagcccaga gcccgccgct ttccccagaa gacagcctgg 162 0 

gggctgtggg gccagcctcc ctggagcaga gacagagcga cccatacctg gcccaggccc 1680 

cggccccgca ggcagctgaa ttcctgagcc cagtgacaac cccttccccc tgcactctgt 1740 

gctccgccca agcctcaggc cctgaggctg cagatgagac ttgtccccag ctggctgtcc 1800 









S) 


J 






-.zcczzczgg 


cgtcagcaag 


ccgggcccgc 


a.gzgzczzcz 


aagcgacgg- 


z'Z zca^raaccr 


-360 


-gaaccagug 


acccccgggc 


gcccctgcgg 


caactttgca 


S9 C ^ CCCZ - 


agcgca:ccc 


1920 


cacgaccccc 


gcctcgaggg 


cc^cczgccz 


zagcaacggg 




i sccct gcca 


1980 


gcaggccccc 


caggatnccc 


cctgccctgt 


gccgcacccu 


ggcggcggcc 


acagcg-gct 


2040 


ggcgacaccc 


agggcagccg 


cctggccatg 


crgcccccgc 


accgtgcccc 


gcgggczzrg 


2100 


rtgctgngaa 


gaggtgggug 


gtigggcctct 


gcgrccacca 


ggccccaccg 


gcccacgccc 


2160 


cttggggggc 


ntgagacaaa 


ccctttctgc 


cccccagggc 


cagcgaagcg 


gcctc"gga 


2220 


caccagctca 


ggggacaccg 


gccccacagg 


agctgtgagc 


ccncnagggc 


=gggcgggag 


2280 


cegggaccct 


caggtgtagc 


cgagccgcga 


cactgccgg't 


cacccnrggc 


gcccccgcct 


2340 


ttttgaaaga 


tgctttrcnt 


ctttttaact 


gacgtagaat 


gaagaaccgc 


a ~ g c gg czzc 


2400 


ccugtccctg 


cggaaaagcc 


atctcaggct 


ggcggcagac 


acactgccac 


cagaggggag 


2460 


cagcggctct 


ggtcctcgga 


gctggctcct 


ctctcccacc 


cnaagggcag 


-cccccaugg 


2520 


ccctgtctgt 


ccttctgaag 


cgtgtccatc 


ctgacctgcg 


ggccctcagc 


:gcccccaca 


2580 


ctngtgccag 


cccggagggg 


actggtcccg 


gtcaccgngg 


acgtgctggc 


crznggtangt 


2640 


gccaggcttg 


ccngggctgg 


gcagccttgg 


gggggctgcc 


tttgtggtgg 


gcgctgggga 


2700 


agtacgtccc 


agcggcccca 


gggcctaagg 


agcgctagtg 


ccttgcccac 


aggngcggga 


2760 


ccacctgatg 


tgatgtgaat 


actcttccca 


catacattaa 


acacacctaa 


gugaga 


2816 



<210> 5 
<211> 3772 
:.:212> UNA 

■:213> Homo sapiens 



<220> 

<221> misc_£ eacure 

<222> 431, 441 

<223> n = a or g 



<221> misc_f eature 

<222> 498 

<223> n = c or t 



<221> misc_f eature 
<222> 579, 599 
<223> n - g or c 



<400> 5 

gancccccat ttccagccaa caaarccttt ttaagctcct 1 1- gaga i; teg zcacgcgiigc 50 

zcgccacacc caggactctg gaaagaagcc caggccagag czrtgggcag gcggccantt 12 0 

aggcaagggc cctgtgttgg cttcctggtg gggttgccct gctggtgggc gggagaccaa 180 

gagcaccccc gcaacaccag gaggcaggtc gcggattgtg ctgtctacac cccggaaggg 24 0 

gtacattcca ggctgccgcc ccagactcac ccctcgcccg ggacccgcac tcttgagctg 300 

tgggtaccac ggtggccgtc cccttctgtt ctgtgcagtg gacttcctgg ctcctcctta 360 

gccttggggc cccacagccc tcggcttggc ttccctcccc atagccaggc cctgggtaac 420 

tccaggggaa ngtgaccctg nggcccccca cttctccccg tgctcctgca caggccrtgg 480 

gctttcggcg gtgctgtntg ccgcagcccc acgccttcct gggagagtgg cccaggcccc 54 0 

ccttcctgag tgtgactgcg ctgccgtctg cgaggcctnc gcgggtctcc cccgggctnt 600 

cctgctggga tggggactgg tggccccggg ccacgtcctg gatccggctt gctccttggt 660 

acaagccgta cgggtcacgg tcaggcagga gggcgggcgg cggggtcccg ggggcgccga 720 

gctcggggcg tgcggtcccc aagagcaggc tgtgcgtgtc cccgttggag ccccacgaag 780 

gcggcccagg gcacccctga gggcgcgtgg gccgacccgc gtcccggatc cagcttzgcgc 840 

caggaatgca ggtgttccag ggtgccaaaa ggaaaacgca caaggcctcg ccgaggaggg 900 

ggggtcagga ggggaccggg ggtgggaaga acgcggggga gagggacggc agggtgcccg 960 

cccgagggac cgacaccrcc gcgagtggca ccccaggatg ctgacgccgg cgggggtggg 1020 

ggcccgaggg gcggtcgggg tcagggggcg gccccagggg tagggccgca gcacgagggg 1080 

ccgcgtgacc cggcggtgac cgggtgggga gaggccggcg ccggggctgg gagacggccg 1140 

cgggcgggag ggtgccccgt ggggacgctc ccgccgcagc gcccggccac gcgcgaggcc 1200 




ccgccctcag gacgcgt~cg cgggacggac ccgcccaccc cgcagccgcc ggcccgccgc 1.260 

gcgccttgtg ggcgcxgtag -crccggag" cgcgcgcgcg gggccgggtc cgggagcccc 1320 

agggcagccg ccccgccgag ccgcaggcac cggcggggaa cggggccacg gggcgcgngc 13 so 

^gggggctgc ggggtgnctc ggggccccgg ggtgagngcg gggcgcgggc cgaggctcgc 1440 

agggccctgt gaggtgagcg ngggggcugg cgccggggrc cgcggggccc cggggagggr 1500 

gcggggcgtg ggccggggcc cgcggtctgc agccrggggt ccgcggggcc -ggggaggg- 1560 

gcggggcgng gccggggucr gcggtctgca gcccggggtc tggggggccc tggggagggn 1620 

gcggggcgug ggccggggnc icgcgggggu crgcggcggcc ■ cgggggcctg gcagaaccgc 1630 

cgctgtgcac ggggtttccc gccgczcgcz ^tccgccgca gcctgcgaan ggggtgggga 174 0 

gtcccgggcc ccagcc'tgcc ccccgcgnca ccctggggcg ccaagtccca cccccgggrc 1300 

tggaggaaag cgtggacccg cgttcgcgcc caggcacgtg ttgcttcggg acgggccagc 1360 

c^gtgggtga accctgccag ccacgcgngg ggcgggcccc tggcacatct ccagaccacu 192 0 

gtctcctgtg ccagaagctt tgtaggcgca acttcccct't ggagcagctg tgggtgcgga 1980 

tccagcggac gaatcccgag gcgtctcaga gagagcctgg acagccgcxg gagcctcrce 2 04 0 

cgagcgggtc cttccaacac cgccacagca ggaaagccat ccccctaggg tcctgtccan 2100 

cggaaacccc tgtcctgggg agtctgcccg cctggcctca ggacacaggc caactaagcc 2160 

ggccccgaaa tccagaatgc atccagaggg aaggcgggat aaagtccttg gagcgcctgc 2 220 

tggccgcccc gtaaagaggt ggcctccccc tacggagacc cgaggatccc cgcacagccc 2280 

agattcaatc agcagagccg aggtgcctct ggcccagtgc acctgccngc cctgtccagg 2 34 0 

cctgggagcc aggctgcanc tcacnggccg cctttgcctg ggtgccacct gngcactgcc 2400 

cgttgcaact gctaactgci: ctctttccga agggcrrrgg aggattttta taactccaga 2460 

tagcacagcc atctcngctg gacacagacg agaaagagtg cttctcgggc g^ttgggccc 1520 

gcagcagcga tagccggagg tccaattatg ctgttaggaa ccctgaactc ggtcatctga 2 580 

acaggggcgg gagggtgtgc aatgcutccn tcttcttctt ' cttcttttta aactagcagg 2640 

cgttctaaaa aacataacga acattctrgg ttagccntcc agagtaggag ctggtttaaa 2700 

cacggaacga taggtggcgr ttgcttgngt tttgactgcg ggtctctggc cntctctggc 2 760 

gcttggaagg acagggccng ggtggggctg g.tcaccgtgg acagtggggc cggggacttg 2820 

caggggctgt tacaaccttc tcctgaaggc agggattctc tctgcttccc cgtggcccrc 2880 

ctgtctggtc ggggact tec ttcagatgee gggaagaggc ctcaagctgt atgggactgg 2 940 

gctggggtct ggacactngg agtctaggcg tcccccggct tggggctgcg tttctatgac 3 000 

ggtgaccaag ttccctatct ttcctcttgg aggtggtctg ggccgtgatg gccaagcctc 3060 

tgtcagtggg ctacgttcac ggcacataag ttgagtatgc eggcagcaga ggctgactgt 3120 

taagaccagc ■ agcagcccct tgctggcgga gactccggct gtctctccaa ggaaggaacg 3180 

ttctggtcgc ttctggaggt ggcacctttc agaacagggg gcccaagtac ccagggct cc 3240 

cgggcccccg ggggtcctgt gggtgggatc tgacucctgc ggccatggac tgtgggcgca 3 3 00 

gaccc.ugggc ttagttcagc ucctgacggc tccccgttgt etgeggegat ctggttgctc 3360 

cggttgtctg gggatcggtg cgcctsgtcta aacctgenga caggtgggaa agcgaacccg 3420 

acagggagtc ccagggccaa atgggtctcc cagnggggag gagegggege ggcctgaggr 3480 

acgtccagct ctacccgtgg cccccccggg catcagggtc cccggtgatg gagcccaacc 3 540 

tttgtgcact gatcttccca gctgttgaca ggccccgagg aggcgtggaa ggtgaggccg 3 600 

aggcaggega ccgtcagatc tgccccggcc tggcagegge ccctgcctgc gcttcczccz 3 660 

gcctggccgg ctgttttcat cctggccctt tgagaacttc tagggtcctg gctgccrcca 3720 

atggagggtg ctggtcccat cttcttccca gctgtgccct gccgtggagc tc 3 772 



<210> 6 

<211> 1536 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 1066 
<223> n = t or c 

<22l> misc_f eature 
<222> 1136 
<223> n = a or g 




< 2 2 1 > misc_rsature 

^222> 1497 

< 223 > n = c or a 



^400> 6 

gggggggggg ggaccactng gcccgccncc g-cccgccgc gccacttggc czgcccccgn cO 

cccgccgcgc cacttcgccc gcctccgncc cccgcccgcc gcgccacgcc tgtggccggc 12 0 

zcggagccgc cgcgccggcc cttgcccccc gccgcacagg agcgggacgc cgagccgcgr 130 

ccgccgcacg gggagctgca gcacctgggg eagatccaac acatcctccg ctgcggcgtc 2 40 

aggaaggacg accgcacggg caccggcacc ccgtcggtat tcggcatgca ggcgcgctac 3 00 

agcctgagag atgaattccc tctgctgaca accaaacgcg cgttctggaa gggtgttttg 3 60 

gaggagttgc cgtggtttat caagggaccc acaaatgcca aagagctgtc ttccaaggga 4 20 

gtgaaaatct gggatgccaa tggatcccga gactttttgg acagcctggg attctccacc 4 30 

agagaagaag gggacttggg cccagtttat ggcttccagt ggaggcattt tggggcagaa 54 0 

tacagagata tggaatcaga ttattcagga cagggagttg accaactgca aagagtgatt 6 00 

gacaccatca aaaccaaccc tgacgacaga agaat caeca cgtgcgcrtg gaatccaaga 5 60 

gatcttcctc tgatggcgct gcctccatgc catgccctct gecagttcta tgtggtgaac 720 

agtgagctgt cctgccagcc gtaccagaga tegggagaca ^gggectegg tgtgcctttc T 30 

aacaccgcca gctacgcccr: gcccacgnac aegattgege acatcaeggg cctgaagcca 340 

ggtgactztta cacacactci gggagaugca cacattnacc -gaatcacat cgagccactg J- 00 

aaaattcagc ttcagegaga acccagacct: :tcccaaagc ~caggattct tcgaaaagnn ?60 

gagaaaattg atgacttcaa agctgaagac ccccagactg aagggtacaa nccgcatcca 102 0 

actattaaaa tggaaatggc cgtttagggt gctttcaaag gagctngaag gatattgtca 1080 

gtctttaggg gttgggctgg atgccgaggt aaaagctcct tttgctctaa aagaanaagg 1140 

aactaggtca aaaatctgtc cgrgaccnat cagttaccaa cttttaagga ugttgcGacu 12 00 

ggcaaatgta actgtgccag ttct^ttccat aacaaaaggc tttgagttaa ctcactgagg 1260 

gtatctgaca atgctgaggt tatgaacaaa gtgaggagaa tgaaatgtat gtgetcttag 1320 

caaaaacatg tatgtgcatt tcaatcccac gtacttatzaa agaaggttgg tgaatttcac 1380 

aagctatttt tggaatattt ttagaatatt xtaagaattt cacaagctat tccctcaaat 1440 

ctgagggagc tgagtaacac catcgatcat gaegtagage gtggttatga actttanagt 15 00 

tgttttatat gttgctataa caaagaagrg urctgc 1536 



<210> 7 
<211> 1187 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 276, 321, 534, 556 
<223> n = c or t 

<221> misc_feature 
<222> 452, 640 
<223> n = a or g 

<221> misc_f eature 
<222> 492, 625 
<223> n = c or a 

<221> misc__f eature 
<222> 458 

<223> nucleotide in position 458 is c, or absent 
<400> 7 

gatcgcgcca ctgcacncca gcctgggtga gagagegaga ctctgtctca aaaaaaaaaa 6 0 





aaaaagaccg 
:ctttttttt 
cggatcctgg 
-cccaagtag 
catgctgtcc 
caaagtacra 
agtagaaaag 
aaatgcaaac 
accgcaggaa 
ctcccaaagg 
aagactctca 
cccctggcgc 
gagcgcggga 
cccgcctccg 
cctccgtccc 
ttgccccccg 
cacccggggc 
accggcaccc 
ggcccctgcg 



ccagggccca aacaaaaaac 
tztt:t:tt:"::g ggacagcccr 
ct:caccgcaa cctctgcctc 
ccaccacgcc cagccaaccc 
aggccggtci: ngaactcccg 
ggattacagg cgtgagccac 
ccgcatcata ccacxtgctL 
cncttattag ctgtaggaaa 
aacgtgggaa ctgtgccgcr 
gcgcagnccc tcccngccac 
gctgtggccc tgggctccgc 
acgctctcca gagcgggggc 
cggccgcggg aaaaggcgcg 
tcccgccgcg ccacttggcc 
ccgcccgccg cgccatgccc 
ccgcacagga gcgggacgcc 
agatccaaca catcctccgc 
tgtcggtatt cggcatgcag 
ggacgggcgg cgggaaggag 



gcnccg—gc -caggcugga gtacaacggc 
ccaggrzcaa ^aaci zzzc cgccccagcc 
::gnar.:::: agcagagacg ggggcctcac 
acctcaggcg anccacccgc cccggccccc 
cgcgc ccagc gccccggcgg cttttaacca 
cngtcgcncc cagcgagaac gaagaaatgg 
^agacctcaa acagcagttn tgtngacaag 
ggcttagaga aggcgcggco gaccagacgg 
ogcaccrgcn cccaggtcec cgggcntccc 
-ccgtgccac acccgcggcn cccgcgtttc 
cgccgcgacc ccgccgagca ggaagaggcg 
cggaaggggt cczgccaccg cgccacttgg 
cgcctccgtc ccgccgcgcc acttcgcctg 
gtggccggcc cggagctgcc gcgccggccc 
gagccgcgcc cgccgcacgg ggagctgcag 
zg egg eg tea ggaaggacga ccgcacgggc 
gcgcgctaca gcctgagagg tgacgccgcg 
ggaggcgcgg ccgggga 




-2 0 
130 
-40 
300 
350 
420 
430 
540 
5 00 
660 
720 
730 
340 
900 
960 
1020 
1080 
1140 
1187 



.'21Q> S 
'-■211> 18597 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature t 
<222> 701, 13751 
<223> n ■ = c or a 

<221> misc_f eature 

<222> 716, 1293, 2401, 2429, 2618, 3083, 3125, 3635, 4256, 4898, 
5062, 5167, 11069, 13298, 14479, 14730, 14796,15344, 15450, 
15503, 15590,15840, 15149 

<223> n = a or g 

<221> misc_reature 

<222> 732, 1379, 1590, 2438, 3212, 5006, 11238. 11422, 11586, 
12598, 13171, 13645, 13782, 13306, 13313, 14536, 14788, 
15042, 15546, 15770 

<223> n - c or t 

<221> misc_f eature 
<222> 1322, 1688 
<223> n = c or g 

<221> misc__f eature 

<222> 2594, 11293, 1S199, 16203 

<223> n = g or t 

<22l> misc__f eature 
<222> 3619 
<223> n = a or t 

<221> misc_£eature 
<222> 14547 

<223> nucleotide in position 14547 is c, or absent 



-;400> 3 

cctgtagccc cagctacgcg agaggccgag gcagcagaat :accigaacc caggaggcgg 50 

aggccgcagt: gagccgagac cgcgccaccg caccccagcc -gggcgagag agcgagaccc 120 

tgtcceaaaa aaaaaaaaaa aagaccgcca gggcccaaac aaaaaacc:: ggaaaagccc 130 

iggcggccut tttttttttt tttttttttt czrtcrggga cagncczgct ccgtcgccca 240 

ggctggagca caatggucgg atcttggccc acugcaacct ccgcccccca ggctcaagca 300 

actcccctgc cccagcctcc caagtagcca ccacgcccag ctaatc cttzg tacttttagt: 360 

agagacgggg gtttcaccat gttgtccagg crggtcrrga actccngacc tcaggtgatc 420 

cacccgcctc ggccccccaa agtactagga tcacaggcgt gagccaccgc gnccagcgcc 480 

cnggcggnct: txaatcaagt agaaaagctg cactacacca cttgcuccgg ttzgcctcagt: 540 

gagaacgaag aaatggaaat gcaaatcccc tattagntgt aggaaacaga tcncaaacag 500 

cagttttgtc gacaagaccg caggaaaacg tgggaacngt gctgccggct cagagaaggc 560 

gcggtcgacc agacggttcc caaagggcgc agcccttccc ngccaccgca cccgcntcca 720 

ggttcccggg tntcctaaga ctctcagctg tggccccggg ctccgutctg tgccacaccc 780 

gtggctcctg cgtttccccc tggcgcacgc tctctagagc gggggccgcc gcgaccccgc 840 

cgagcaggaa gaggcggagc gcgggacggc cgcgggaaaa ggcgcgcgga aggggtcctg ' 900 

ccaccgcgcc acttggcctg cctccgtccc gccgcgccac ttggcctgcc tccgtcccgc 960 

cgcgccacct cgcctgcctc cgtcccccgc ccgccgcgcc acgcctgtgg ccggctcgga 1020 

gccgccgcgc cggcccttgc cccccgccgc acaggagcgg gacgccgagc cgcgtccgcc 1080 

g.cacggggag cngcagcacc cggggcagac c:aacacatc ctccgccgcg gcgtcaggaa 1140 

ggacgaccgc acgggcaccg gcaccccgcc ggcacucggc acgcaggcgc gccacagccr 1200 

gagaggtgac gccgcgggcc cctgcgggac gggcggcggg aaggagggag gcgcggctgg 1260 

ggagagcgct cgggagccgc cgggcgccgc ggnccccgrc cagccctaac c^caaccctg 1320 

cnagggaggg gacgcatcgt cctcctcgcc ctacagacgc cgaaacggag ggtcccatna 1380 

gggacgtgac tggcgcgggc aacacacaca gcagcgacag ccgggaggta agccgcgtcc 1440 

cagcggctcc gcggccgggc tcgcagtcgc cccagcgang ccgtggcccc cgaggcgggc 1500 

gtcatcgggc agcgtttgcc cagtgctgga gggttaggga gagctgcctg ggcttgaccg 1560 

cgcgccggtc tcaaagtcct ggctttggcn ccccctccgt tttcccctgt ggaccattcc 1620 

gcttcgcagc gttttcaaaa actggagcga aagcgacgtg ggcggggcaa aggcggcggg 16 80 

aagagganag cactgaagct ggcgcgggaa crtggttticc tggtggcccc ccatccaatc 1740 

cccacgaacc agctttcctc ttaaaccttg aaaagagaaa ttcgggagtt cgagttctta 1800 

.gccgtccttt cctctttcct ttccgacagg agcaccccag gcaaaaaatg tctcgcgggc 1860 

cat.tggcgcc aggctttcag gggacagtgg ggcggggcgg ggtigggcaca ggacgttagg 1920 

cagccgttgg ccctccctaa ggccacaccg tcctgccgtc ctggatcctg cgccagctgc 1980 

3 c 5TO99 a 59 ggactcgaag gtgtgtgagc caggggccga ccttgaccgc tcagacaaat . 2040 

ggagcgcagc cctgacacag gggtggaggt ggctctgaat ggggaaaccc' acccgtggcg 2100 

aagcagactc actgtagcta gcggaaaagc cccccggccc acggacccat ctagagacga 2160 

ataca'cagca gctgctzgtgg ctgattggcg cgggacagcg tggggagctt tgcctsgagga 122 0 

gagggatcca cttttctgca gctccaagcc caggggcctt tgatgagcca tagaccccat 2280 

ttttaaccca cctttctgct tagacattga gcaagttact tctcatanag cttccctata 2340 

tgttaaaaat ggagaaaata atgcctagta ggcaatcctg ataaaagcag gtgcttgcaa 2400 

naatctctct gttgtctgaa tataaactnt accacaagcg agtgcggatg aacgaggacn 2460 

gcatttaaag ataagttttt acactttnat rtctctgtgg ctcgacactt ctgatrgcctc 2520 

cctttttgtt cctgggacac atgcttggtg ttgtcttcac acctttgtga caggattagc 2 580 

actagtgggc agtngatgat agctcctcct cccttttncc acatgttcat ccctgccctc 2640 

gccaccatct cactgtgtgg aattcctgtg tccactggtc accggggcac agaagtgctg 2 700 

tctcagcctg aatcgggcca ctgatgggac ttgcagcctg ggagctccac cgtgatctct 2760 

ggcccacttt gcgggagtct aggctttctg gacgcnccag gcctcacgtc ccagggcagu 2820 

cttcttccct gaagaaagtt ggatggcatg acctgncttc ccatcntgaa accgtatggc 2880 

aaattgtttt tcagatgaat tccctctgct gacaaccaaa cgtgtgttct ggaagggtgt 2940 

tttggaggag ttgctgtggt ttatcaaggt aaagaagtcg ctgctattag aagtcagtag 3 000 

tctgttctca acacagcagc cagtgagatc ccttcaaaac tcaaagcagc caggtgtggt 3 060 

ggctcacgcc tgtaatccca ccnctttggg aggctgagtc agatcacctg aggttaggaa 3120 

tttgngacca gcctggccaa catggcgaca ccccagcctc tactaacaac acaaaaaatt 3180 

agccaggtgc gctggtgcat gtctgtaatc cnagctactc . aggaggctga ggcatgagaa 3240 

ctgctcacga ggcggaggtt gcagtgagct gagatcgngg cactgtaccc cagccnggcg 3300 



acagagggag aacccaugrc aaaaacaaaa aaagacacca ccaaaggcca aagcaca::a j 3 60 

--ccccaccc :caagccc:: agcggcccca cutcacncag naagagccac ggccccracg 3 420 

gcgtccgcct: -tcagccc-g accttagccg ccgccccctg caccaccc:g ccgt:cc" cgc 3480 

gagtttntga gcacaccggg acanccccac cccctggaac cctcttcccc cacac—ggc 3 540 

"zcttccct t: gagcctccac cceacccggg caagccttcc tagaccnccn gacctaaaac 3600 

cgtgaccctc ccccaaccnc cccggtgrct ctccntagac gaacaccacc atccgacgra 3 560 

cgt cage etc ccccttcccc tgncagaagg gggacagcag gtagtaaaag cgaaacgcgc 3 720 

egtaagcett acgagggcag aggattngtt tcrcgtgctc actgttgtat cgccagggcc 3 78 0 

tcaaacacag cctgccacat agtaggagtc aacatatact gatcactaaa ugcagac a cc 3 340 

acctgcgttc ccatgcccac ataaattcca gaagagnctc ttcagtaaca aggtgaacec 3 900 

cctccagagg gctgagcagg tacctcaggc eggggecaga gtgctgtgaa gacagcagea 3 960 

gcccagacca agcttctccg tgttccgtgt cccggtctag aaccagegar gttctncccg 4020 

accagtgent tttggaaggc ggccgaggtc tgggctcagg tccgggccan accagaagct 4 080 

gggatccctt ctatagagca cttggtatgg cttgtatggt cctggggcaa gccagaccca 414 0 

agccctctta tcccatttta gaaagggctt caatttggat ccagccccag gtctgectta 4200 

gctctgtatt cttggggtat tttgttctgt attggecnat cttgactaac aacgancctt 4260 

ggatttgaaa catatcatca gaaacct;cag aagacaacat tcttaaactg gctagagccc 4 320 

ggtctgaatg gatgaaaagg agagacnttt gaagcaatat gtaaaagatt gagaaatgat 4 3 80 

ttgttggaaa cttctcaatt ggagaaattt ccttgactstg ttggaaattt ctttgacccc 4440 

- rencaatea aagaaaaccg ggacaaaccc aacaatagaa agggaggaag caagatactc 4 500 

agaaacaaaa tgcatncccc cgctrcaacc taacgcctca atsreaggae" ctsaaggaacc 4560 

cctgccagga acgtcagacc caccc~gata gntggagtca ccccaccggc gacccgacca -16 20 

aatacaggag ctgaggcacc tgcaccgcaa aacactgact agrccgatca craggaacac -^6 80 

cctgtangcc aggtagaaga tacattgaac agactgearg taggcactaa actcat:rt:t:g 4 740 

gggcattaca tatagacaac acacttcact aagaaacata aaactgtcag aceggeggaa 4 8 00 

tactcaaaag cacttggagg tgtttagcct, aaaaagctta gttgag'ggga anggaagaaa 4 36 0 

agatctggga gggtggttcc aaagaaggga tcagactntc ctaaagccct: caggaatctg 4 92 0 

ggctgggacc acctacttaa agataggatg ggcagctggg tgtggtggct cacgccrgta 4 980 

atcccagcac ttegggagge cgaagngggc ggatcacctg aggtcaggag ttcgaggeca 5040 

gcctgaccaa catggagaaa cnctgtctc't: actaaaaaCa caaaattagc tgggtgtagt: 5100 

ggcgcatgcc tgtaatccca getacteggg aggctgaggc aggggaatcg cttgaacctg 5160 

ggaggtngag ggtgccgtga gccacgatcg cgccattgca ccccagcctg ggcaacaaga 5220 

gegaaactet caaaaaacaa aaaaaaggat gggttccata tgggtggcgt: caagcgccca 5280 

ccccctagca agtcagcagg ggecagagge ccttgtaagt ggtgtctcgg ggggatcaac 5340 

tgagatggct taagatttac ctggacgcct gctctgctct ccccatctct; tccagggatc 5400 

cacaaatget aaagagctgt: cttccaaggg agegaaaate tgggacgeca acggatcccg 5460 

agactttttg gacagcccgg gatstccccac cagagaagaa ggggacttgg gcccagccta 5520 

rggctnccag tggaggcatt ttggggcaga acacagagat atggaaccag gtgaggagat 5 5 80 

agaacaaege ctrcccanttc cgggcgcccc tcccagcacg tgtttgctcc gctgtttrag 5640 

ataaggtctg ggggatgagt caatgtcaca ggagctgatg tatagctttg accttgtgag 5 700 

gggtggtgcc aggttgaagc cacaattaac gectactgaa ggccgtttca catcttttztt 5 760 

tttttttttt ttttaattat tatacnttaa gttttagggt acatgtgcac aatgtgcagg 5820 

ttagttacat atgtatacat gtgccacgct ggtgcgctgc accactaact caccatctag 5380 

catcaggtat atctcccaat gctatccctc ccccctcctc ccaccccaca acatccccag 5940 

agtgtgatgt tccccttcct gtgtccatat gttctcgttg ttcgattccc actatgagcg 5000 

agaatatgeg gtgtttggtt ttttgttctt gcgatagttt actgagaatg atgatttcca 6060 

cttcaccacg tccctacaga ggacatgaac tcatcatttt ttatggctgc atagtattcc 6120 

atggtgtata tgtgccacat tttcttaatc cagtctatca tgttggacat ttgggttggt 6180 

tccaagtctt . tgcctattgt gaatagegee acaataaaca tacgtgtgca tgtgtcttta 5240 

tagcagcatg atttaatagt cctttgggta tatacccagt aatgggatgg ctgggtcaaa 63 00 

tggtatttct agctctagat ccccgaggaa tcgccacact gacttccaca atggttgaac 5360 

tagtttacag ccccaccaac agtgtcaaag tgtcctattt ctccacatcc tctccagcac 542 0 

ctgttgtttc ctgacttttt aatgactgee actctaactg gtgcgagacg gtatctcatt 5480 

gtggttttga tttgcgtttc tetgatggee agcgatggtg agcatttttt catgtgtttt 554 0 

ttggctgcat aaatgtcttc ttttgagaag tgtctgttca tgtccttcgc ccactttttg 5600 

atggggttgt ttttttctca taaatttgtt tgagttcatt gtagattctg gatattagee 5660 

ctttgtcaga tgagtaggtt gcaaaaatgt. tctcccattt tgtgggttgc ctgttcaccc 6720 



tigacggisagi: ~:cctttgcr g-gcagaagc :. ::::ag:-: aactagaccc "acttgccaa -730 

ctttggcttt cgttgccacu gccc~ tggca caggca-gaa gccct^gccc acgcctatigt; c 3 4 0 

cccgaacggt aacgcc^agg c::::::::a gggcnr r r at ggtttraggt ctaacgctica 5 900 

agtscrrtaac ccarccrgaa ucgacctttg tacaaggcgt; aaggaaggga tccagttcca 5360 

gcttt-taca cacggctagc eager rrccc agcaccaccc actacatagg gaatccuttc 7Q20 

cccatrgctt gtctttccca ggcc-gtcaa agaccagaca gttgtagaua cgcggcg-ca "08 0 

cttctgaggg ctctgctccg utccactgat ccatgtgcct gttttggcac cagtaccata ^140 

ctgttutggt tactgtagee ctgtagcata gtttgaagee aggtagegeg atgcctccag "200 

ctttgrtctr ttggcttagg acngacctgg egatgeggge tctctttztgg ttccacatga ^260 

actttaaagt agctttttcc aactzcugtga agaaagtcan tggtagcctg atggggangg 7 32G 

cattgaatct ataaattacc tegggcagta eggecanntt cacgatattg actcttccta ^380 

cccacgagca tggaatgguc tzccauttcc ttgcatcctc ttttatttca ttgagcagtg 7440 

gtttgcagtt ctccttgaag aggcccccca catccctctt aaggtggatt cctaggtact ?500 

ttattctctt tgaagcaacn gtgagtiggaa gttcacucac gatttzggctc tctgtttgtc 7560 

tgttattggt gtataagaat gcctgtgatt tittgeagatt gattttatat cctgagactt 7520 

tgetgaaget gcttatcagc ttaaggagac ettgggcega gacaatgggg ttttctagat 7680 

atacaatcat gccgnctgca aacagggaca atttgacctc ctcttttcct aattgaatac 7740 

cctttatttc cttctcctgc ccaattgccc tggecagaac ttccaacact atgttgaata 7800 

ggagtggtga gagagggcat ccccgtcttg tgccagcctt caaagggaat gcttccagtt: 7 8 60 

tttgeccatt cactatgaca ctggcugtgg cc-tgtcata gatagctcrt artattttga 7920 

aataegtrnce atcaataccx aacctactga gagectrcag cacgacgtgt tgttgaac.tt "98 0 

egtcaaagge ccnttctgca cctatztgaga taatcacgcg gcttccgcct titzggauctgn 3040 

etatangecg gattacactt accgacttgc gtatatugaa ccagccttgc auccxaggga 3100 

tgaagcccac acgatcatgg eggataagee ctttgangtg ' ctgctggatt cggttcgcca 3160 

gtattttatt gaggattttrt gcatcaatgt tcatcaagga cattggtcta aaattctctr 3220 

ttttggcgcg tctctgccca gccccggtau caggacgatg ttggcutcat aaaatgagtt 3280 

agggaggatt ccctcttttt ctat^tgatrg gaatagtctc agaaggaatg gtaccagtt.c S340 

ctctttgtac ctctggagaa ttcggctgtg aatccacctg gtcctggact ctctttggtt 3400 

ggtaagctat tgattattgc cacaatttca gctcctgtta t tggtctatt cag.agattca 3460 

acttcttcct ggtttagtct tgggagagtg tacgtgtcaa ggaatttatc catttcttcc 3520 

agattttcta gtttatttgc gtagaggtgt ctgtagcaat ctctgatggt agtttgtatt 3580 

tctgtgggat cggtggtgat atccccttta tcatttttta ttgegtctat ttgattcttc 8640 

tctttttctt tattagtctt getageggtc tataaatttt. gttgatcctt tcaaaaaacc 3700 

agctcctgga tccattaact ttttgaaggg ctnttcgugt ctctatxtcc ttcagttctg . 3760 

ctctgatttt agttatttct tgccttctgc tagcttttga atatgtttgc tettgetttn 3820 

ctagttcttt taattgrgac g.ctagggtgt caatrttgga tctttcctgc tttctcttgc 3880 

gggcatttag ugctataaac ttccctctac acacngccc.t . gaatgtgccc cagaggttc:: 3940 

ggnatgttgt gtctttgetc ntgttggctt caaagaacat ctttatttct gecttcattt: . 9000 

cgctatgtac ccagtagcca cccaggagca ggttgtzccag tttccangta gttgagcagt: 9060 

tttgagtgag attcttaatc ctgagttcta gtttgactgc actgtggcct gagagatagt 9120 

ttgttataat ttctgttctt teacatttge tgaggagagc tttacttcca actatgtggc 9180 

cggttttgga ataggtgtgg tgtggtgctg aaaaaaatgt atattctgtt gatttgggat 9240 

ggagttctgt agatgtctat taggtctget tggtgcagag ctgagttcaa ttcctgggta 9300 

tccttgttga ctttctgtct cgttgatctg tgtaccgttg acagtgggtg ttaaagtctc 9360 

ccattattaa tgtgtggagt ctaagtctct ttgtaggtca ctcagatgat .tggcacttac 9420 

tgggcgcttg gcactttcca tactgtgtca . teggcagata gctgcatggt tggtgttcgt 9480 

gctggggaat gggaagttca tcggtgggac aaggacaaaa tgcccccatt gctttgttgt 954 0 

ggctttaatc tccctttcga ggctgageca cagcgtgctg taggtggcgc tgctgtzgaag 9600 

cgcagtacca gggtcacact ccactcccag etctgeagag gtggagaaag aatgaaacau 9660 

ctcactcctg gacttccac.t ttcctgtcac tgttggtgtc acctcttact ggatgtcaca 9720 

gagcccagcc cctcccacct gtgcctagga aaagcagatg ccaccttgga atgtggggtzc 978 0 

tgtgtgtgca atttactagc tgggcagaga ccagcaacct ggagagcagg tgtctegtet 9340 

aaggggacag tcacatctca cctccagcca cctggaggaa tttgggcctg gtgatgtcag 9900 

aattcttcaa taaaagecta aaatctatat tttatgtgcg gtcatgagat ctgttaaatg 9960 

ttagcaactt caggaagttt aaaaatgctg tgtggaccta gaataggcaa gttcttaaag 1002 0 

gcagaaagtg gaatgctagt ttccagggac tggggaacag ggaggaaugg ggagttcatg 1008 0 

tttaatgggc acagaggttt tgttagggat gacgaaaaag ttegggagat ggcgatggcg 10140 




r.zggagangg cgatggcgac ggagacggcg a :: ggnga-gg zgatggrgat gggtgacggc 1 2200 

gacggegacg gtgatggcga cggagacggc ganggcgacg gcgacggaga cggtganggc 1026 0 

gacggcgacg gtgacggaga cggcganggn gacggagacg gcganggnga uggcgatgga 10320 

ga cggr gang gtgaeggega cggcgatggt gacggzgacg gegaeggaga tggagacggt 103 80 

gaeggtgang gttgcccaac ancaggaacg zgccnaacgc tcctgaaccg cacacaaaaa 13440 

cggcaagttt aatattatgt gcactttacc acaacgaaaa aagccgcngc gtgggccaag. 10500 

-tacttgtgc aggtaargtt ergcaggegg crgcccgcac cccagc-gta gggtgcccgt 10560 

aggatgtgag gccagccccc gggcttaacg acgccncaaa ccctgcctag tattcaatta 10520 

etnettgteg cttaaaaggc ccaacaaaac cacggcecna gtttacagtg gratgaatgc 10680 

tcagctgttg gattttagta ggaaagttcg cccccuetcg cttttaactt tgttttacag 10740 

attcacagga attttttttt tttttttcct cnttttttnn caatgeacag aaagtttccc 10800 

tzggactctct acccagtrtc cccagtgata atatcttggg caacaccctg tatacattca 10860 

cactggcgca ttcctcag.ag ttgtcagatt ttgcragntt tacgtgcact tgtgtatgtg 10920 

egtatttgea attttagcac gtgtagacce ttgcaaccac cacaatcaag ttacagaact 10980 

acactaccaa' ggttcacctt tttaaaatct ttgatgttac crtttttgga acagtgacca 11040 

tgagaggact ttcctcccaa aattttgana accactgaac cagaatatag tctgacacta 11100 

ataggtagaa atttaaccaa aggagattat gaagccctgc acttrgagtta acaaaatcac 11160 

ttctcagctt ccagttccat ctcagaagga aggaaaaggg actaaaaatc cagagaccag 11220 

aaaatgggag caaagtanaa ggcggtgtaa tcattacaga ggttccctga tgtttccaag 11280 

ccagtcgtgt gengagctge taaaccctaa agcaacttta ggtggaatgt tggaaacatg 11340 

ergctgaggt gatagaaagg aanccanggc oczczgzzag ucggaaagta tatggaanac 11400 

-acattctac ataagataca anactcccrg cgagacaagg acaaagcaga ttttgtcagr 11460 

gaaattgnga caagaaccgc tgaegggert agagecuaag ertgegagga gcactggaag 1152 0 

aaattaagat tgttgagatt ggaaagggtn agetaegggg gaacaggagg aggtgactcc 115 80 

atgacagacc aaatattcaa aggacngtgt agaagaggaa aaagaccttg ttaggjgctcc 11640 
agaggacaga gecaggagtc agacagggee ctrgaactcaa cccacngaga tetgeaaact . 11700 

ttgcaggatg caccagatgt cctcjtagcca egggtcaagg ggggaccctg ggtaagagac 11760 

tgtaatagat gacctccaag gccatctcat gacatgtgcg attaatgtat gtacctgtcc 1182 0 

tctctttttg acaattctac agattattca ggacagggag ctgaccaact gcaaagagtg 11880 

attgacacca tcaaaaccaa ccctgacgac agaagaatca tcatgtgcgc ttggaatcca 11940 

agaggttgaa agaaccccgt cgncttcatt tatactaacc atactcttag agggaagcaa 12000 

tctggttttg tgeagaggea ctgagggagg caggaccccg ggcaacttcc cccagccaca 12060 

tggttgtgtg acgttgggca agtcacantt tgetgeaett tcaccrtcag atcatgaggt 12120 

tgggeccaga ggattttttt tttttttttt ttttttgaga cagagttttg ctctgttgcc 12180 

caggctggaa tgcaacggcg tgatcttggc tcactgtaac ctctgcctcc tgggttcgag 12240 

tgattctcct gcctcagcct ccaagtagct gggattacag catgtgccac catgcctggc 12300 

caattttgta tttttagtag agaegggute acatgttgge caggcnggcc ttgacccctg 12360 

acc.ctcagat gatctgeett gccrcagcct cccaaccgag cgatcetaag ttgtgtatta 12420 

cactcattcn tacacaaaaa gggctttaaa tgcccagaaa ccacacgaag aegttaacac 12480 

cttaaatgga agcagatgaa gttccagctc gctgccacct cactaacatt tttaacaatt 12540 

atattgtaaa attcaactct accagggtgt agagecaggt gtggtggctc acacctgnaa 12600 

ctccaacaac tccagaggcc aaggegagag gatcatttga acccacggaa tttgaggctg 12660 

tagtgagtca tzgatcacgcc attgcactcc atcctgggca acagagtgag accctgaata 12720 

tttaaaaaca acaacaacaa caaaactcta tcaggatatc ataagtactt agagtgaaat 12780 

acttgeatet gtaatagaga cttatttttt ttttttttga gacacagtct caccctgttg 12840 

cccaggctgg agtgcagtgg tttgatctcc gctcacggca acctccatct cccaggttca 12900 

agtgagttcc cattcctcag ccccagagct gggaccacag gcgcgcgaat ttttgtattt 12 960 

ttagcagaga eggggtttea ctatgttggc caggctagtc tcaaactcaa gttggcctca 13 02 0 

agtgatctgc . ccaccctggc gtcccagtgt tgggatttca ggcatgagee actgtgcctg 13080 

gccatgtaat agagactttt aatataggag ggtgtaccag aagcaccagt ttcctgtggc 13140 

aaacagaact attcctgetg tatctgtaac ntggtgccac gaggtagece agatccct.tc 13 200 

agctctgatg gaagagcatt gcttcagccg taaatggaca ectgeagaaa ccttgcaccg 13 260 

atggatagtc tccctcagct cSgtgccatc getgeagngg ctgttatgga catcactgca 13320 

gcccagtggc tctctctcct ggtctccacc atatgagttg gcttctgttt. ctctcctgtt 13380 

ttactttgee tttagctgtg gtctttcaaa ccaccatccc tccttatctt cctctgctgg 13440 

ttcctcagat cttcctctga cggcgctgcc tccatgccat: gccctctgcc agttctatgt 13500 

ggtgaacagt gagccgtcct gccagctgta ccagagatcg ggagacatgg gcctcggtgt 13 560 




j.-rczr ccaac accgccagcz acgcccrgc:: caegzacaig ~ ~ "gcgcaca ::acgggccr 13620 

gaaggcgggc cgtctcggga agggngacct gccagccnac cacracgagcr ::tcagcc:c 13680 

- ^aacatggg aaaacaaacc gcagagctta gccrcrgac" agcxcccaaa z-tgatacgc 13740 

g-aagnaaga natgaaccag cttttacctt gaaaccc:cc cnrtccggaa ggntttccgg 13800 

-crcgnggta nangcaccaa cagatctaca caggticgczt gcgacacagc ctatggac 13360 

:::ci:caaaa gccatgcnga ggttgggtac ggtggcccat gcccgnaacc ccagcacntt 13920 

ggaagaccga gacaggagca attgcutgag gccrggagct caacaccagc ctgggcaaca 13 980 

zaacaagatg cugttgccac aaaaaaacgg aaaagccaca c:aaautatc nttttaaaaa 14040 

aagccttigcg gtgtctgcac attctaatgt tttnaaacga cgctntiaaag aattgaaacr 14100 

aacacactgt tctgcttrcn cccggcttat agccaggcga ctrtiacacac actttgggag 14160 

acgcacatat ttacctgaac cacatcgagc cactgaaaat ccaggtaaga actagatgtt 14220 

auacctttgg gtttggcacc ttctcttgat aaaaggncga ccgtggaaca ggtatctgct 14280 

caatgctgtg tccaagataa agatgactgc cccaaatgtg gggctc-agc ttagggagaa 14340 

gnggtzgggca ggtgggcagg acaaggcagg catctgcccc agcaaccacg gcacttaact 14400 

tgccaggtgc tgtgaggnac taagcaccag taccagagag ggaagagcca cattcaagcc 14460 

aggggattgt ccaaaaggng gcattttaac tcactttaac ttgaaggaga attgaagtgc 14520 

aaatgttttt ccttttcttt ttttttgnag atggagtctt cctctgrcgg ccaggctgga 14580 

gcgtgncgtg gtgcgatctc agctcactgc aacctccacc teccgggttc aagcaattct 14640 

tctgccncag cctcccaggt agctgggatt acaggcacat gccacracac ccagctaatt 14700 

3-Ctgtatta . ttagcagaga tggggnttcn tcatgiztggc caggcrgatc ccaaacccct 14760 

•jacctcaagc gtaccaccrg cctcagcntc cgaaancncc ggaaccacag gcacaagcca 14 820 

^cacccnggc cataaauac: ctttgntaac trnacaxcaa gnacaacacc raggtccaaa 14880 

cntcaaaagt ccgttgaaac ccctgaagtt: auageagcca acaactgaca cgaaatggca 14940 

ataaaaatgt aagttcatcc gcttcatgag ccttaaggaa aaaaactcag aaccagacac 15000 

cctttagccc cttccaggtt agatccaggc tttaaaagct antcctttga gggagtttgg 15060 

ccgcctttga gtggaggtga cttcaggctr attctctctg gccctctgcr ctzggtcattt 15120 

ttagacatag taataggttg tga^ctgtct tcacatccta attgccactg tctgttcatc 1518 0 

ccaggaatcc tggctttcat ccctttctgt tcactgtcca tgcatgtcat ctttccttct 15240 

ctctgccagg gaccagangg gttagggatt gtgaattcaa gcaaacgtag agctactatg 153 00 

agttacagat tgactgtgtt cctgtcttta acaaatttgc caanagtggt tataagaact 1536 0 

tacacctgat gaggcaccag gctcctgacg ctgtgtaacg tcacaaaata cccctcactc 15420 

tcgatctgtg caagagaaca gctggttgcn ctccaatcat . gttacataac cta'cgcgaag 15480 

gtatcgacag gatcatactc ctntaaaata gaactttgtt gatcacatcc cgtgtacrtg 15540 

tttcanggac atgaggagca attacaacag gtcgtacaat tatggcaaan taatggcctt 15600 

attttgtttt tagcttcagc gagaacccag acctttccca aagctcagga utcttcgaaa 15660 

agttgagaaa attgatgact tcaaagccga agaccttcag' attgaagggt acaatccgca 15 720 

- ecaactatt aaaatggaaa tggctigttta gggtgccttc . . aaaggagctn gaaggatant 15780 

gt; cage cut t aggggtxggg ctggacgccg aggtaaaagt tctttttgct ccaaaagaan 15840 

aaggaactag gtcaaaaacc tgcccgtgac ctatcagcra ctaactttta aggatgttgc 15900 

cactggcaaa tgtaacngtg ccagttcttt ccataacaaa aggctttgag ttaactcact 15960 

gagggcatct gacaatgctg aggttatgaa caaagtgagg agaatgaaat gtatgtgctc 16020 

ctagcaaaaa catgtatgtg catttcaatc ccacgtacct ataaagaagg ttggtgaatt 16080 

tcacaagcta tttttggaat atttttagaa tattttaaga atctcacaag ctattccctc 16140 

aaatctgang gagctgagta acaccatcga tcatgatgta gagtgtggtt acgaacttna 16200 

aanttatagt tgttttatat gttgctataa taaagaagtg ttctgcattc gtccacgctt 16260 

tgttcattct gtactgccac ttatctgctc agttccttcc taaaatagat taaagaactc 16320 

tccttaagta aacatgtgct gtattctggt ttggatgcta cttaaaagag tatattttag 16380 

aaataatagt gaatatattt tgccctattt ttctcatttt aactgcatct tatcctcaaa 16440 

atataatgac catttaggat agagcttttt tttttttttt ttaaaccttt ataaccttaa 16500 

agggttattt taaaataatc tatggactac cattttg'ccc tcattagctt cagcatggtg 16560 

tgacttctct aataatacgc ctagattaag caaggaaaag atgcaaaacc acttcggggt: 16620 

taatcagtga aatatttttc ccttcgttgc ataccagata cccccggtgt ngcacgacta 16680 

tttttattct gctaatttat gacaagtgtt aaacagaaca aggaattatt ccaacaagnt 16740 

atgcaacatg ttgcttattt tcaaattaca gtttaatgtc taggtgccag cccttgatat 16800 

agctattttt gtaagaacat cctcctggac tttgggttag ttaaatctaa acttatttaa 16860 

ggattaagta ggataacgtg cattgatttg ccaaaagaat caagtaataa ttacttagcn 16920 

gattcccgag ggtggcatga cttctagcng aactcanctt gaccggtagg attttttaaa 16980 




-cca" czttg naaaactzact: tccaagaaac "tuaagcccc ::cacc:cac: ^aagaaaaaa 17040 

gttgtzcgggg ctgagcacnr aattttctcg agcaggaagg agcttccccc aaacctcaec 17100 

acctggagac nggtgtttct ttacagattc ctccnucacr tctgtcgagc agccgggatc 17160 

ctaccaaaga ccaaaaaaac gagccctgce aacaaccacc nggaacaaaa acagacccta 17220 

tzgcactracg ccgctccaag aaacgccttn acgtctiaagc cagaggcaac :aacnaautt 17280 

rttcttcctt gacacggagt caccgtccgt: tgcccaggct gcagcgcagc ggcgcaaccc 17340 

tggcncacug caacctccac ctcccaggnn caagtgaccc ccctgcctca gccxcccatg 17400 

tagcngggat cacaggcacc cgccaccatg cccggccaat tttttgcaun cctcgnagag 17460 

acagggtzcic accatgttgg ccaggcnggc crcaaacacc tgacctcaaa cgacccaccc 17520 

gccrcagcct cccaaagtgt tgggautaca ggcgcaagcc accatgccca gcccigaacu 17580 

aatactttta aaataagttt ggagactgtr ggaaataata gggcagagga acacatttta 17640 

ctggctacct gccagagtta gctaacncac caaacccttt gataacagtt tgacctcrgt 17700 

tggtgaaaat gagccatgat ctcctgaaca tgatcagaat aaatgcccca gccacacaat 17760 

cgtagtccaa actttttagg tcaccaactn gctagatggt gccaggtttt tgcacaag 17820 

gagtgcaaat gttaagatct ccactagtga ggaaaggcna gtattacaga agccctgcca 17880 

gaggcaattg aacctccaag ccctggccct caggcctgag gattttrgata cagacaaact 17940 

gaagaaccgt ttgttagtgg atattgcaaa caaacaggag tcaaagcctg grgctccaca 18000 

gtctagttca cgagacaggc gtggcagtgg ctggcagcat ctcttctcac aggggccccc 18060 

aggcacagct taccttggga ggcatgtagg aagcccgcng gatcatcacg ggatacttga 13120 

aacgcccatg caggtggtca acatactcac acacccnagg aggagggaac cagaccgggg 13180 

-aatgacgcc cgaagtcaga ntatncacgc ggrgcraact taaagcagaa ggagcgagca 13240 

ccacucaatt gacagtgrtg gccaaggccc agccgrgcca ccacgcgntc ctiaggcaagi: 18300 

ccccaaacct cngtgcccca ggtccttttc ctctaaaaca tagcaangtg aggcggggac 13360 

cttgatgaca tgaacacacg aagtccctcr gagaggnrtc gtggtigccct: ctaaaaggga 13420 

tcaactcaga ctctgtaaat atccagaatt atttgggczc ctctggccaa aagtcagatrg 18480 

aatagactaa aatcaccaca ttttgtgatc catttttcaa gaagcgtutg cattttttca 13540 

tatggctgca gcagctgcca ggggcctggg gtttttccgg caggtagggt cgggagg 18597 
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<400> 9 

cccaggcgca gccaatggga agggtcggag gcatggcaca gccaatggga agggccgggg 
caccaaagcc aatgggaagg gccgggagcg cgcggcgcgg gagatttaaa ggccgctgga 



60 
120 




gcgagggncc gcccgrgcac z-zzgzcczaa ccrgccccgcc ctggccgcuc ^czczgczzz 130 

gctgcgccnc cactatgccc cccccccgcg ccccgctcgc gcccaccacg gacccgcagc 240 

agccgcagcc ctcgccgcrg aaggggccca gcccggucga caaggagaac acgccgccgg ^ 00 

ccccgagcgg gacccgcgcc ccggccagca agaccgcgag gaggatctcc caggagccca 3 60 

-9"9" a gccgaa aactaaagca gctgcccccg gcgcggagga cgagccgccg cugagagaaa 420 

acccccgccg ccttgccacc ttccccaccg agcaccatga catctggcag atgtacaaga 480 

aggcagaggc tncccti^^gg accgccgagg aggttgacct ctcnaaggac atccagcact: 540 

gggaaccccc gaaacccgag gagagacact ttatatccca tgntctggcc ccctttgcag 500 

caagcgatgg cacagtaaat gaaaacttgg tggagcgact tagccaagaa gttcagattza 6 60 

cagaagcccg ctgtttczat ggcttccaaa ctgccacgga aaacatacat tccgaaacg- 720 

atagccttct tattgacact tacataaaag atcccaaaga aagggaattt ctcttcaang 730 

ccattgaaac gatgccrtgt gtcaagaaga aggc.agactg ggccttgcgc tggatcgggg 340 

acaaagaggc tacctarggn gaacgtgttg tagcctttgc tgcagcggaa ggcattttct 900 

tttccggttc ttttgcgtcg atattctggc tcaagaaacg aggactgatg cctggcctca 960 

cattttctaa tgaacttatt agcagagatg agggtttaca ctgtgatttt gcttgcctga 1020 

tgttcaaaca cctggnacac aaaccatcgg aggagagagt aagagaaata attatcaang 1080 

ctgttcggat agaacaggag ttcctcactg aggccttgcc tgtgaagctc attgggatga 1140 

attgcactct aatgaagcaa tacattgagt ttgtggcaga cagacttatg ctggaactgg 12 00 

gttttagcaa ggttttcaga gtagagaacc catttgactx tatggagaat atttcactgg 1260 

aaggaaagac taactr cctt gagaagagag caggcgagca tcagaggatg ggagtgatgt 1320 

caagtccaac agagaacncn cccaccccgg angccgacrt ctaaacgaac tgaagacgng 13 80 

cccttacctg gczgazzznz n-titcccatc tcacaagaaa aatcagccga agtgttacca 1440 

accagccaca ccatgaactg cccnuaatgt tcattaacag catctttaaa actgtgtagc 1500 

tacctcacaa ccagtcccgt ctgtttatag tgccgguagu atcacctttt gccagaaggc 1560 

ctggctggct gtgaccracc atagcagtga caatggcagt cttggcttta aagtgagggg 1620 

tgacccttta gtgagnttag cacagcggga ttaaacagtc ctttaaccag cacagccagt 1680 

taaaagatgc agcctcactg cttc^aacgca gattttaatg tttacttaaa cataaacncg 1740 

gcactttaca aacaaataaa cattgttttg tactcacggc ggcgataata gcttgattta 18 00 

tttggtttct acaccaaata cattctcctg accactaacg ggagccaatt cacaattcac 1860 

taagtgacta aagtaagcta aacttgtgta gactaagcat gtaattttta agttttattt 1920 

taatgaatta aaatarttgt taaccaactt taaagtcagt cctgtgtata cctagacatc 1980 

agtcagttgg tgccagatag aagacaggtt gtgtttttat cctgtggctt gtgtagtgtc 2040 

ctgggattct ctgccccctc tgagtagagt gttgtgggat aaaggaatct ctcagggcaa 2100 

ggagcttctt aagttaaatc actagaaatt taggggtgat ctgggccttc atatgtgcga 2160 

gaagccgttt cattttattt ctcactgtat tttcctcaac gtctggttga tgagaaaaaa 222 0 

ttcttgaaga gttttcacat gtgggagcta aggtagnant gtaaaatttc aagtcatcct. 2280 

taaacaaaat gatccaccta agatcttgcc . ccngctaagc ggtgaaacca acnagaggtg 2340 

gttccuacaa gttgtt-cant ccagttttgt tcggtgcaag taggntgtgc gagttaactc 2400 

atttatattt actatgnctg rtaaatcaga aactttttat tatctatgtt ctnccagacu 2460 

ttacccgtag ttcataaaaa aaaaaaaaaa aaaaaaaaaa 2 500 
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<400> 10 

acggggcrcg gggccgggcg gccagacgct: ' aactcggatg cticccaggcc acgccntggc 50 

catgacccgt gcggccgcgc gcccccgc:: r caccnccgg cgcgcgcttc cccacgcagc 120 

agacgacgtg cggc'cccggg ccaggccacc tggtgccogc tcgcatgacc gcgcgcggca ■ 130 

ccnacggcgc ccccgcccac cccatctaeg gccgcccacg ccgctcagcg cccctcctca 240 

ctccgggacc tggtcaggac cc ccgggccc ccggccaccc caacgccgaa ctgcgtccag 300 

ggaggcccac ctgggaaccc ccgacccgaa ccccgagccc ccctcggata ccctaacacg 360 

atattcggta cccccacatc cggatcccaa accccaaacc ccgaacccac ggggctttga 420 

caaatcgtgg ctcagacccc ccactagncc eaggacccca tctcgggtac ccaccaggct: 480 

ccnacgcagt tctagccccc cacacccttg acccgccrcg caggcaggta cttcccggag- 540 

cgagcgggga acgcgacgta ccccagtgcg cctcggcaca ccattgctcc ccgaaactgg 500 

ngtgtccagg cggaacagca gagcccaggn cccgcggcct atacggtgcc ctcgctcttg 660 

ggtccgcgcg tcatcggcaa agtctccgcc ccaacccgct ccatctacgg ccgcagagcg 720 

gctggcagtt tcttcgagga cctcagcaag gtcgtgagtc caggggncta caagtcccgg 780 

gccccccagt tcacgattct ggcgcggact tcgctccccc aagacaacac tcggaagcca 840 

gggcccgcgg cctacaacgt ggatcagcac cggaagcccc gcggctggag tttcgggatc 900 

cggcactcgg actacctggc cccgccggtg accgacgcgg acaactgacc cgccaggcgg 960 

gagcggcccc acacgtgttt gcctaaagtc cgcgagtccg catcgrgtcc gcctctctct 1020 

ctctctctct gcgcgnccng gcgcaaggcc -ggggcggag ccacggctgg ggccgtgtcc 1080 

caactccgaa cccagcgggg cggggcccga gcgncgggcg aggccgggac cccagcgctg 1140 

cgccgcgtcc gaacgtcgag accccaccga gggcgggagg gggaccctcg ggagccacag 1200 

acgcccgaga cccacgccgg gcgggaccgg ccagggacca cccccgccga cggccccggg 126 0 

ccccgacggc ccggaagttc cgcgtgcccg ggggcaccng gggactggcc ggggcgcggc 1320 

gtgcaaggct tcccgggggc ggcgactgcc gagctccgcc ctccaggcgg ccccacccgc . 1380 

ctgccgtcct ggggcgccgc cgcc^ccgccg ccggcagtgg accgctgtgc gcgaaccctg 144 0 

aaccctacgg tcccgacccg cgggcgaggc cgggtacctg ggctgggatc cggagcaagc 1500 

gg^cgagggc agcgccccaa gcaggtacgg gcggggctca agtcgcgagg cggggaagcg 1560 

ggaggcagac acggacgagg gcgacacaga cacgggaccg aggggcggac accggagaga 1620 

cacgggaaag gggtcgggac aggagcacgt ggctcagaca ccgacgccgg gaggccgcag 1680 

accccggacg tgtcaggcat ccccgcaggc ccggagcg 1718 



<210> 11 
<211> 5847 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__f eature 

<222> 124, 3346, 5024, 5484, 5650 

<223> n = c or t . 
<221> misc — f eature 

<222> 439, 1333, 1979, 2151, 2469, 2977, 4784, 5268, 5631, 5733 
<223> n = g or a 

<221> misc_feature 
<222> 1045 

<223> nucleotide ac posicion 1045 is c, or absent: 

<221> misc__f eature 
<222> 1046 

<223> nucleotide ac position 1046 is t, or absent: 

<221> misc_f eature 
<222> 2636, 5287 



223 > n = e or g 

-.221 > mis cofeature 
<222> 3118 
2 2 3 > n - g or t 

<221> misc_feature 
^222> 3257, 4053 
<223> n = a or c 

<221> misc_f eature 
<222> 5440 
<223> n = tora 



<400> 11 

gatattcggt accccatatc cggatctcaa atcccaaacc ccgaacccca cggggctttg 60 

ataaatcgtg gctcagactc cccaccagtc ccaggacccc atctcgggta cccaccaggc 120 

tccnacgcag ttctagcccc ccacaccctt gacccgcccc gcaggcaggn acttcccgga 180 

gcgagcgggg aacgcgacgt accccagtgc gcctcggcac accattgctc cccgaaactg 240 

gggtgtccag gcggaacagc agagcccagg cgaggtcaga acggcccacc ccagaactgt 3 00 

gggccrticcc actcgagacc ggggaccgcc c-ccgggagc tzgggaccacc ccgcgccrgt 3 60 

c.cgcggagac ccactacccc cgagccccgc czcczcccca ggncccgcgg cctiatacggt 42 0 

gccct.cgcxc ttgggtccnc gcgccatcgg caaagtctcc gccccaacct gccccatcna 480 

cggccgcaga gcggctggca gtttcttcga ggaccccagc aaggcggggg aggggccggg 540 

gcggacgcag ggggtccctg gtccgcggca gcggaggcgg cagccagcac cctctgccct 600 

cccgcagacc ccgggcccct gcgc'cnatca ggccgtgagt: ccagggguct acaagccccg 660 

ggccccccag ttcacgattc tggccjcggac ttcgctcccc caagacaaca ctcggaagcc 720 

agggcccgcg gcctacaacg tggatcaggt ggcctggagc ccagggrcaa gggncagagt 780 

caggagagtg gggagggcct gaggtcggag tgacgggacc agagtccccg ggggtccagg 840 

ggtcccggcg cggagaggat gccggccccg cgaggtcagc ggtgtczccg ggcccgcagc -900 

accggaagcc ccgcggctgg agcttcggga tccggcacnc ggaccacccg gccccgctgg 960 

tgaccgacgc ggacaactga cccgccaggc gggagcggcc ccacacgtgt ttgcttaaag 1020 

tctgcgagtc cgcatcgtgt ccgcnnctct ctctctctct ctcrgcgcgt cctggcgcaa 1080 

ggcctggggc ggagccacgg ctggggccgt gncccaactc cgaacccagc ggggcggggc 1140 

ccgagcgtcg ggcgaggccg ggaccccagc gccgcgccgc gtccgaacgt cgagacccca 12 00 

ccgagggcgg gagggggact cccgggagcc acagacgccc gagacccacg ccgggcggga 1260 

ccggccaggg atcacccccg ccgacggccc cgggccccga cggcccggaa gccccgcgtg 13 20 

^ccgggggca ccnggggatt ggccggggcg cggcgcgcaa ggcttcccgg gggcggcgac 13 8 0 

tgccgagctc cgccctccag gcggccccac ccgccrgccg tcccggggcg ccgccgcccc 1440 

gccgccggca gtggaccgct gtgcgcgaac cctgaaccct acggtcccga cccgcgggcg 1500 

aggccgggta cctgggctgg gatccggagc aagcgggcga gggcagcgcc ctaagcaggt 1560 

acgggcgggg ctcaagtcgc gaggcgggga agcgggaggc agacacggac gagggcgaca 162 0 

cagacacggg accgaggggc ggacaccgga gagacacggg aaaggggtcg ggacaggagc 1680 

acgtggctca gacaccgacg ccgggaggcc gcagaccccg gacgtgtcag gcatccccgc 1740 

aggcccggag cgatggcagc cttgatgacc ccgggaaccg gggccccacc cgcgcctggt 18 00 

gacttctccg gggaagggag ccagggactt cccgaccctt cgccagagcc caagcagctc 1860 

ccggagctga tccgcatgaa gcgagacgga ggccgcctga gcgaagcgga catcaggggc 1920 

ttcgtggccg ctgtggtgaa tgggagcgcg cagggcgcac agatcggtgc gtggggagng 1980 

ttgggcgttc ctgaccccga ctgggaggcc agcccgagag actttgggtc cctgggggtg 2040 

cgacggtgcc ccactaccag caccggcccc agggtgcccc accgctgrgg gcrgccaccc 2100 

tcacgcgtac ccccacatac caggggccat gccgatggcc atccgacutc ngggcatgga 216 0 

tctggaggag acctcggtgc tgacccaggc cctggcncag tcgggacagc agctggagtg 2220 

gccagaggcc tggcgccagc agcctgtgga caagcattcc acagggggcg tgggngacaa 2280 

ggtcagcctg gtcctcgcac ctgccctggc ggcatgtggc tgcaaggtta gaaaccacct 2 340 

cctttccaga cgggagccta taccgcacat gcagcaacca gtccacccac aggcagctcc 2400 

caacctcaag cctggcccaa agcctccaag accctaccaa ggcttctccc caccctgctc 2460 

cccagcacng ttctccccac cccgttcccc agcacagcgc ttggggcccc tctggctcca 2520 




gaccaggccc: cccggagcag gaaaaagatc cactgacgga actcagacc: ::::ccccc: 2530 

gggtccccag acagctcccc caagggagga gccgaggact zccczccczc rgcccnaagc 2640 

cccgcccccc caaggagagg caccaacctc ctcccccact gacacttccc aaccaagaaa . 2700 

acctccttcc cattccctca ccagctgggc acccctatag ccgcutaaac accttccaaa 2760 

- ccagctgca ctcctagcca gggaagguga agggangcac agaggngggg gaggggcacc 2 320 

gtgcagggta cccagcatcc ctgaccacca ggcgccaatg accagcggac gtggtcnggg 2880 

gcacacagga ggcacctcgg ataagctrgga gtictacccct ggatccaatg ccatccagag 2940 

cccagagcag gnacggggcg ccacggatca gtcattr.atc caggccgatg atccagaccc 3 000 

cggccagaac cactaaaaga ccactggugg atcactaggg tcactaacga gaacaccggu 3060 

caaggntact: cacgagtcac tgggcctggg ccgaaatcac cagtggaacc . ctgattanga 3120 

ccataaaatg ggaagttggt caaaatcaca gatggctggc ggggcacggc ggcccacacc 3180 

tgtagtcctza gcacttgggg aggccgaaga gggcagatcc cttgaaccca ggagttcaaa 3240 

accagcctgg ataacanggc aaaaccccat ctctacaaaa tagttcgctg cgtgtggngg 3300 

tgcacgcaug tggttccagc cactcaggag gctgaggcag gaggancact tgagcctggg 3360 

aggtctaggc tgcagtrgagc cgggacgatg ccactgcact ccagcctggg caacagagtg 3420 

agaccctgtc ccagcactct gggaggcaga ggagcccagt tggagatcag cctgggtaat 3480 

atagtgaaac ttgatctcta caaaaaaaag aagaaaaaaa aaagccgcgt gtggtggtgc 3540 

gcacctgtag tcccagctac tgggaagctg aggtgggagg atcacttaag cccaggaggc 3600 

agaggtcaca atgagccgaa atztgtgccaa ctgcacncca gcctgggcaa cagaggaaga 3 660 

ccctccacag aaaaaaaaaa aaaaaaaaag ctgctaagtc atntaccata agtcactgag 3 720 

aacagggga.c gtctgaccag acgcaagtgc cgctggacca ggcgggcugc tgtat.cgrgg 3 780 

grcagagcga gcagccggct cccgcggacg gaacccnata cgcagccaga gatgcgacag 3 340 

ccaecgtgga cagcccgcca ctcatcacag gcgaccrgac tccacggcct gcctctgcac 3 900 

gttcacaggc rcctgaccnc caaacccaag tcaagggcct ctcgccagga gctacccgcc 3 960 

acctgaccgt gtgcccccct: acccccatca caagatgcct gaccaccacc atgtgggcgg 4020 

cctgacactc aacccaccag gtgcngccac ccncataata agggacttga ccctcaacgc. 4080 

t cagggcc cc tgaccccaaa gtcggcatcc ccgaactctc ccaagaagct ccaggttctc 4140 

cattgtctcc aacctcctct gcctccccca aagcctccat tctcagtaag aaactcgtgg 4200 

aggggctgtc cgctctggcg gtggacgtta agttcggagg ggccgccgtc ttccccaacc 4260 

aggagcaggc ccgggagctg gcaaagacgc tggtgagcgg tgtggccttt ccctgggcaa 4 320 

gcgtcttgat gcgggcccag cctacccttc acccctcccg . tccccactgc ctccctccac 4380 

tcagcagtcc tgcctaaccc cagtcccacc ctcttctgcc cgaagtccct ccctccttca 4440 

cggcttccta acctgctgtg actttagagg tcaaggctgg cccggcctgg acctggggaa 4 500 

gccctcngtg gggttcctgc cccagaccaa gtacaagttc ctcctggccc cacggcgagg 4560 

tgtcgcactt cactcgtgtc tcttccccac cccaatcctt ccctgacttc atgctggggg 4620 

gctggcaacc caccctgcag caggggccgg agctcgacca agaaccggct gcagaaggcc 4680 

ccgccatggg gggtccacgc tgagccucct ccccgcaggt: tggcgcggga gccagcctag 4 74 0 

ggcttcgggt cgcggcagcg ctgaccgcca tggacaagcc cctnggtcgc tgcgtgggcc 4 800 

acgccctgga ggtggaggag gcgctgcrct gcatggacgg cgcaggcccg ccagactcaa 4860 

gggacccggt caccacgctc ggtgaggggg acggggtgta ggggagcgga ggcggcgggg 4 920 

ggtgcttccc gctggggccg ccccgacccg gccgcgccta agacccgtcc ccgcccgcag 4 980 

ggggcgccct gctctggctc agcggacacg cggggactca ggcncagggc gctgcccggg 5 040 

tggccgcggc gctggacgac ggctcggccc ttggccgctt cgagcggatg ctggcggcgc 5100 

agggcgtgga tcccggtctg acccgagccc tgtgctcggg aagtcccgca gaacgccggc 5160 

agctgctgcc tcgcgcccgg gagcaggagg agctgctggc gcccgcagat ggtgagcgtc 5220 

gggggagtcc ccgtccttcc gcctccgcca tccccttccc ttcccgangc cccgcccctt 5280 

cccgagnccg cgcctctcag cccctctccc cgcaggcacc gtggagctgg tccgggcgct 5340 

gccgctggcg ctggtgctgc acgagctcgg ggccgggcgc agccgcgctg gggagccgct 5400 

ccgcctgggg gtgggcgcag agctgctggt cgacgtgggn cagaggctgc gccgtggtga 5460 

gcgccgcccc cgccctgctg gccncgcacc cccgcccagc tccggccgcg cggcctctaa 5 520 

cagcccctcg ctctgcaggg accccctggc tccgcgtgca ccgggacggc cccgcgcuca 5580 

gcggcccgca gagccgcgcc . ctgcaggagg cgctcgtact ctccgaccgc ncgccattcg 5640 

ccgccccctn gcccttcgca gagctcgtcc tgccgccgca gcaataaagc tcctttgccg 5700 

cgaaaccttg tcagtgcttg ggcgggagcg ganggatcca gggctgcgga ggcgggggcc 5760 

gtctcgatga acacgtgacc cccggcgggc tccgccctcc gcgcacgcgc tgagagcctg 5820 

tcagcggctg cgcccgtgtg cgcatgc 5 847 






2158 
DiVA 

Homo sapiens 



<22Q> 



<221> trusc__f eature 
<222> 802, 1900 
<223> n = c or n 

<221> misc_f eacure 

<222> 1747 

<223> n = t or g 

<400> 12 

gcgcggcata acgacccagg tcgcggcgcg gcggggcttg agcgcgtggc cggtgccgca 60 

ggagccgagc auggagtacc aggacgccgt gcgcatgccc aataccccgc agaccaatgc 120 

cggccacctg gagcaggtga agcgccagcg gggtgaccct cagacacagt tggaagccat: 180 

ggaactgtac ctggcacgga gtgggctgca ggtggaggac ctggaccggc -gaacatcat 240 

ccacgtcact . gggacgaagg ggaagggcnc cacctgtgcc ttcacggaac gcatcctccg 300 

aagccatggc ctgaagacgg gactcttztiag ccctccccac ccggcgcagg c-cgggagcg 3 60 

gatccgcatc aacgggcagc ccatcagccc cgagcrcctc accaagcacc ~ ccggcgccc 420 

ctaccaccgg ctggaggaga ccaaggatgg cagctgcgcc tccacgcccc cccacctccg 480 

cntcctgaca ctcatggcct tccacgcctt cctccaagag aaggtggacc tggcagtggt 540 

ggaggtgggc attggcgggg cttatgactg caccaacatc atcaggaagc ccgtggcgug 600 

cggagtctcc tctcttggca tcgaccacac cagcctcccg ggggatacgg cggagaagac 660 

cgcatggcag aaagggggca tcttjtaagca aggtgtccct gccttcactg tgctccaacc 720 

tgaaggtccc ctggcagtgc tgagggaccg agcccagcag atctcatgtc ctctatacct 780 

gtgtccgatg ctggaggccc tngaggaagg ggggccgccg ctgaccctgg gcctggaggg 840 

ggagcaccag cggtccaacg ccgccttggc cttgcagctg gcccactgct ggctgcagcg 900 

gcaggaccgc catggtgctg gggagccaaa ggcatccagg ccagggcccc tgtggcagct 960 

gcccctggca cctgtgttcc agcccacatc ccacatgcgg ctcgggcttc ggaacacgga 1020 

gtggccgggc cggacgcagg tgctgcggcg cgggcccctc acctggtacc tggacggtgc 1080 

gcacaccgcc agcagcgcgc aggcctgcgt gcgctggttc cgccaggcgc ngcagggccg 1140 

cgagaggccg agcggtggcc ccgaggttcg agtcttgctc ttcaatgcta ccggggaccg 12 00 

ggacccggcg gccctgctga agctgctgca gccctgccag ' tttgactatg ccgtcttctg . 1260 

ccctaacctg acagaggtgt cauccacagg caacgcagac caacagaacc tcacagcgac 1320 

actggaccag gtcctgctcc gcngcctgga acaccagcag caccggaacc acctggacga 13 80 

agagcaggcc agcccggacc tcrggagtgc ccccagccca gagcccggtg ggrccgcatc 144 0 

cctgcttctg gcgccccacc caccccacac ctgcagtgcc agctccctcg tcttcagctg 1500 

catttcacat gccttgcaat ggatcagcca aggccgagac cccatcttcc agccacctag 1560 

tcccccaaag ggcctcctca cccaccctgt ggctcacagt ggggccagca tactccgtga 1620 

ggctgctgcc atccatgtgc tagtcactgg cagcctgcac ctggtgggtg gtgtcctgaa 1680 

gctgctggag cccgcactgt cccagtagcc aaggcccggg gttggaggcg ggagcttccc 1740 

acacctncct gcgttctccc catgaactta catactaggt gccttttgtt tttggctttc 1800 

ctggttctgt ctagactggc ctaggggcca gggctttggg atgggaggcc gggagaggat 186 0 

gtctttttta aggctctgtg ccttggtctc tccttcctcn tggctgagat agcagagggg 192 0 

ctccccgggt ctctcactgt tgcagtggcc tggccgttca gcctgtctcc cccaacaccc 198 0 

cgcctgcctc ctggctcagg cccagcttat tgtgtgcgct gcctggccag gccctgggtc 204 0 

ttgccatgtg ctgggtggta gatttcctcc tcccagtgcc ttctgggaag ggagagggcc 210 0 

tctgcctggg acactgcggg acagagggtg gctggagcga attaaagcct ttgttttt 215 8 

<210> 13 
<211> 2630 
<212> DNA 

<213> Homo sapiens 



.-^220> 

<221> misc_f eature 

<222> 1424 

<223> n = c or a 

<221> misc__f eature 
<222> 1649, 2554 
<223> n = a or g 

<400> 13 

ctgattggta tgggacxgtt ggagcccaca gaacgcgcaa gaccagcctg ggngaggagg 60 

ctgtcttagt tgagaccaac gcggcgaaca gggtgagcca ggcgcagagg cctggagana 12 0 

gaagacgggg aggactgggg ggcracagac agcccggggg gatggggcac caggaacaaa 180 

ccgagggaca caggagagat gaggcacgga ggccagtagc atcagtccct gcagggtggg 24 0 

ggaaggccag gacgctcggg aagggagtcc cgatgacccc agctgtcccg gcagctctcc 3 00 

ccacctggtg caggttcggg agcggatccg catcaatggg cagcccatca gccctgagcc 360 

cttcaccaag tacttccggc gcctctacca ccggctggag gagaccaagg tgccgcatgc 42 0 

agg^gggctg gcgggtgggt atgg.ttgggg gtgctacgtg ttccagcacc ccatctcccc 480 

agagaagggg ctgcatggct ctgggccccg acatgtccct gtgccacagg atggcagctg 54 0 

tgtctccatg cccccctact tccgcctccc gacactzcacg gccttccacg tcttcctcca . 500 

agagaaggng tgtgcccrct: ccccagaacc ccgcauccga ggccttggga acgggaaccc 660 

cagcaggcct gggggcrccc tgcttccatg cggcccctzgg gcaccctcac atcccctgcc 720 

atgccctctg gtctttgaca ggcggacccg gcagcggngg aggtgggcat tggcggggcr 780 

tatgactgca ccaacaccat caggngagcg cagttgcttg ggacgagggg tggcagccag 840 

gagcacagcc tcacctgcgc ctggtggctc agggcaggcc tcacggcctt ttcctccccc 900 
gcaggaagcc tgtggtgtgc ggagnccccc ctcttggcan cgaccacacc agcctcctgg . 960 

gggatacggt ggagaagatc gcacggcaga aagggggcat ctttaaggtg accaggcaga 1Q20 

ctgggggaag gg^g^ga-cat ggaaggcctg ggagtcxacg ttttcatcct ggcttcactg 1080 

tgtgactgga acaagttgag tctcctcccc agactatttc cccattgaaa cgtgagggac 1140 

ggctgggcat ggtggcttat atgcttgcaa tcccagcatt tcaggaggtc gaggtgagag 1200 

gatcacctga gatccggagc ttgagaccag cctgaccaat atggggaaac tctgtctcta 1260 

ctaaaaatac aaaaattagc caggtgtggc ggtgtacgcc tgtagttcca gctacttggg 1320 

agactgaggc aggagaatca ctcgaacccg ggaggcagac gttgcagtga gccgagattg 13 80 

cgccacagca ctccagcctg ggcgacagag tgagacttca tctngaaaaa gaaaagaaaa 1440 

gaaacatgag ggatgagaga cagtggtagc ccagacccag ggatgtgggg gccagagata 1500 

ggagtgtgga ggatgcnagg cagccctttc tctctccttc ttccctccac agcaaggtgt 1560 

ccctgccttc actgtgcccc aaccxgaagg tccccxggca gtgctgaggg accgagccca 1520 

gcagatctca gtaagtctga ttggaacgng gcagcggcag ggtgggnttg tgcccctccc 1680 

gcttgaggag gcactgcatc ccctggggcc tcagtttgcc catctgtgca gtgaggacgc 1740 

tgggccagct gccaggcctg ctggaacaca tctcagttct gggagcaggg cttggtggcr 18 00 

99999^9999 agagatgcaa gggctgacgt ggtcagggag ggcctctgct gacccgc'tcc 1860 

tgcctgtctc ccctagtgtc ctctatacct gtgtccgatg ctggaggccc tcgaggaagg 1920 

ggggccgccg ctgaccctgg gcctggaggg ggagcaccag cggtccaacg ccgccttggc 1980 

cttgcagctg gcccactgct ggctgcagcg gcaggaccgc catggtgagt gggcagctga 2 040 

gtgggcaggc aggtgggtgg cacctgtgga gcctgcctag gagggtcccg gacacacttg 2100 

gtctcacaca ccccgcaggt gctggggagc caaaggcatc caggccaggg ctcctgtggc 2160 

agctgcccct ggcacctgtg ttccagccca catcccacat gcggctcggt ■ gagttagacc 2220 

ttcctgccca gctgggacca ctgcgtgtgt ctgtgcccct tcagattttt tttttttttt 2280 

ttttggnttt ctgtttggga gataagagac aatttgaagc ggtgcttaag agaaaggact 2340 

ctgatgtcag caaacctccc tgaccttgag ctcatgaact ctttctgagc ctgtcttctc 2400 

atctgccaaa gtagatgatg ataggagcca ctgccacggg ctgtggtggg gattcgccga 2460 

ggtgacatca ctaaggtgct gagtgcagag cctggccaat gtgggataaa gtgccagcca 2520 

gtggtagctg ctgtcactgt cactatcatc atcntcagac cctgaggttc tggaggatgg 2580 

tgatccagtc atctgcxtct tgcctccccc aaagctttca gcacccagca 2630 



<210> 14 
<211> 2912 




-:212> DNA 

< 1 1 3 > Homo sapiens 
<22Q> 

<221> mis cofeature 

<222> 263, 1037, 1139, 1955, 1017, 1037, 1139, 1309 
<223> a =s a or g 



<221> mis cofeature 

<222> 266 

<223> n - g or t 

<221> misc_f eature 

<222> 527 

<223> n ■ = c or g 

<221> misc_f eature 
<222> 1217, 1647, 2282 
<223> a = c or t 



<400> 14 

ggccctgcgt ccagtccctc gactaccctr acgcagccac caaaccatat acacgcatat 50 

gtatagagaa agtttcaacg accaaaaaca aggaaaccaa gaaagaacnt ctctatctgc 120 

catggggcca gggtcggggc accccagcag tgtgtgaaga gcagaagncc agccaatgac 180 

agactcttcc caaaacatca cttgcttatt tcgaaatcaa acaatttctc ataaatattt 240 

tctcccaatg ctgggaagag ggnganggga aggaggcacg gaaaccccat caatcatttg 3 00 

aagggctgcc ttttatcaga ctgajttttcc ■ gtagtgggtt gtttgcagct tcctcctccc 360 

cagttctggg cctcagctgt caaaaggatt tcaccacgca act'ttttcat- gctsagcagtt 420 

ggggccaaga agctaataga tgggaaaaag ctctgaaaac cccaggacga caaataggcg 480 

tcctcctcac agaaaaggat tactgcccca ccatccccag gtggccntca aatccgttct 540 

ctaaacggca gcagctgttt agaggtgtcc accaggtgtc cgcagctttg tcatcctatc 600 

cctgttcggg gcagagactg agggctgctg acccggaccg gctattttgg gacgtgctgc 660 

ggggggcctt gggaggttgg tgacgaaagg agtgcgtgcc cgctaaggga ggggacgccc 720 

cggagcgtac actcataaac ctggtcccga ggcctgcccc tcaccaggat ggtgcacgcg 780 

gaaggggcgg ctttttagtg gcgcaagggg gctggtcggt ggtagtttgg ggcggtgcng 840 

attgatggcg ggcggggcgg ggcggtgctg autggcgggg ggggcggggt gaggcgacgc 900 

tgcgccgatt ggctgggggc ggggcggggc gnctcccgcc cgggccraga gcgctgccgg 960 

^99cgccggg actatgtcgc gggcgcggag ccacctgcgc gccgccctat ccctggcagc 102 0 

ggcgtcugcg cgcggcntaa cgacccaggt cgcggcgcgg cggggccnga gcgcgtggcc 108 0 

ggtgccgcag gagccgagca tggagtacca ggtatcaggc gggccagcgg gccagcggnc 114 0 

ctgggcgcga cgacacgtgg gcctgcgctg agccgcagaa catccgggct ccgctagccg 1200 

agagggtatc gggagcnctg gactggggga ctcggggggc ggaacatcct ggaggctggg 1260 

^9tggggaca gggaccagga agttgggccc gggccgccgg ggctgggaat tcggagacta 1320 

tagcgtcccc gccccgggtt gggaagtggg aagtggcaca ggagctagga tccagaagcc 13 80 

cagaggctca gcggtgcttc tggagttcca gtgatcccgg agtctgaacc ggcagtgaga 1440 

gtggggaaag agggtaggga agagactcag gaattcaggc ttgaaagatc caggagtatt 1500 

gatctggggg tgggctgtcc aggattcaga agattgggga tccaagtgcc tggatttggg 1560 

ggagaggcag gaatcagggg tagtggaggg ccccagaacc tggaaaatag a.aaatgtccg 1620 

cgggcgctgt gtcaagagcc ggttgcncta gaccagaccc tgatgccagt gaggcgggcg 1680 

gcactggttt gatgagggtg gagcctccaa ccagccttga ggtcctgagg gcgggaggca 174 0 

cggaatatga ggcctaaggg gaatgaaata gcacccccac tcccacctcc actgtgaacc 1800 

ctcctgaagc cgtacctacc tgccttcctg gctgagtgac ccctggcaca cccctcctcc 1860 

ctctgagttg ctcctctgtg ggttggaatg tggaacccca gagtcatgag ggttggggcg 192 0 

gagcttcggg gaactccaga attcgaatac cccanccctc tgtagttctg gccccgctct 1380 

ggcagggagc aatatagcaa tggaccccat tggaganaat gagggcaaag gcccagnagc 2040 

gaagtcgggg gagcctgggc aggaagcaag gctagcccgt tagtcatgcc accttctttg 2100 

tgtagcactc cctgggtggg gccgaactgc cccagacccc catttttgcc agagctggaa 2160 
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agacgccaca 
cgggcccggt: 
-ntgtggacc 
cgcgcatgct 
ggggtgaccc 
aggnaaggca 
ggcagggacc 
gtcaccggga 
gccggtggag 
ccacggaacg 

^ggggggacg 

ttcaggacca 
actcaataaa 



czczctgzzg 
actggccttg 
ccgagtgagc 
caataccctg 
tcagacacag 
gagagggcct 
ttgtctgtct 
cgaaggggaa 
tagagcctgc 
tatcctccga 
tggtgtctgt 
gggtcacccc 
cattcagtta 



ctgccccggc 
cctaacctnc 
cagaccaatg 
ttggaagcca 
gtgaccacct: 
gccccaggtg 
ggcgaggggc 
ccagacaatc 
agctacggcc 
gtcccaatgg 
caggaggtca 
gcacttacca 



ctraacccr.c 



aggctaggcc aacagz "zg gcatggcagg 
ttggccac-g gccrgczggc cgcctctgcg 
catccgggca ccgrggr rgc caggacgccg 
ccggcizaccr ggagcaggcg aagcgccagc 
cggaaccgra cccggcacgg agtgggccgc 
cccaccccca cztgtgaccc ccgtagctga 
gaggacccgg accggccgaa catcacccac 
aggacccngg ggnagggggt: ctat naagtg 
ccctttcccc caagggcccc acctgtgcct 
cgaagacggg atzctzzagg cactggcttg 
accctggggg gccacggaac cagccagtgc 
gctgcatgnc tctctgccca gngtttattc 
za 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2912 



<210> 15 
<211> 2196 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic conscruct 

<221> rnisc_f eature 

<222> 1784 

<223> n =s a or g 

<221> misc_feature >t 

<222> 464 

<223> n = g or t 

<221> misc__f eature 

<222> 120, 519, 668, 1059, 1308 

<223> n = c or t 

<221> tnisc_f eature 
<222> 1289 
<223> n ■ = c or a 

<400> 15 

aattccggag ccatggtgaa cgaagccaga ggaaacagca gcctcaaccc ctgcttggag 60 

ggcagcgcca gcagtggcag tgagagctcc aaagatagtt cgagatgttc caccccgggn 120 

ctggaccctg agcggcatga gagactccgg gagaagatga ggcggcgatt ggaatctggt 180 

gacaagtggt tctccctgga attcttccct cctcgaactg ctgagggagc tgtcaatctc 240 

atctcaaggt ttgaccggat ggcagcaggt ggccccctct acatagacgt gacctggcac 300 

ccagcaggtg accctggctc agacaaggag acctcctcca tgatgatcgc cagcaccgcc 360 

gtgaactact gtggcctgga gaccatcctg cacatgacct gctgccgtca gcgcctggag 420 

gagatcacgg gccatctgca caaagctaag cagctgggcc tgangaacat catggcgctg 480 

cggggagacc caataggtga ccagtgggaa gaggaggang gaggcttcaa ctacgcagtg 540 

gacctggtga agcacatccg aagtgagttt ggtgactact ttgacatctg tgtggcaggt 6 00 

tzaccccaaag gccaccccga agcagggagc tttgaggctg acctgaagca cttgaaggag 660 

aaggtgtntg cgggagccga tttcatcatc. acgcagcttt ' tctttgaggc tgacacattc 720 

ttccgctttg tgaaggcatg caccgacatg ggcatcactt gccccatcgt ccccgggatc 780 

tzttcccatcc agggctacca ctcccttcgg cagcttgtga agctgtccaa gctggaggtg 840 

ccacaggaga tcaaggacgt gattgagcca atcaaagaca acgatgctgc catccgcaac 900 

tatggcatcg agctggccgt gagcctgtgc caggagcttc tggccagtgg cttggtgcca 960 

ggcctccact tctacaccct caaccgcgag atggctacca cagaggtgct gaagcgcctg 1020 

gggatgtgga ctgaggaccc caggcgtccc ctaccctgng ctctcagtgc ccaecccaag 1080 




cgccgagagg aagacgzacg :c:caccr"c cgggccccca gaccaaagag ::acacc:a: 1140 

-gnacccagg agngggacga gctccctaac ggccgctggg gcaacncccc zzzccczgac 1200 

tttggggagc tgaaggacxa ccaccccntc nacctgaaga gcaagzcccc caaggaggag 1260 

crgctgaaga tgtgggggga ggagcnganc agrgaagcaa gtgtcttnga agtccccgrc 13 2 0 

cut cacccct cgggagaacc aaaccggaac ggncacaaag tgacttgcct gccctggaac 13 30 

gatgagcccc cggcggctga gaccagcctg ctgaaggagg agccgctgcg ggcgaaccgc 144 0 

cagggcaccc ucaccaucaa ctcacagccc aacatcaacg ggaagccgcc ::ccgacccc 1500 

atcgugggcc ggggccccag cgggggctat gtcttccaga aggcccactt agagcctttc 1560 

acxtcccgcg agacagcgga agcacttctg caagtgccga 'agaagtacga gctccgggnt 1620 

aactaccacc ttgtcaatgt gaagggtgaa aacatcacca atgcccctga actgcagccg 1680 

aatgcxgtca cttggggcat cttccctggg cgagagatca tc.cagcccac cgtagtggat 1740 

cccgt eager tcatgttctg gaaggacgag gcctttgccc tgtngattga gcggtgggga 1800 

aagctgtatg aggaggagtc cccgtcccgc accatcatcc agtacatcca cgacaactac 1860 

ttcctggtca acctggtgga caatgacttc ccactggaca actgcctctg gcaggcggng 1920 

gaagacacat tggagcttct caacaggccc acccagaatg cgagagaaac ggaggctcca 1980 

tgacccngcg tcctgacgcc ctgcgttgga gccactcctg tcccgccttc ctcctccaca 2 040 

gtgctgcttc tcttgggaac tccactctcc ttcgtgtctc tcccaccccg gcctccactc 2100 

ccccacctga caatggcagc tagactggag tgaggcttcc aggctcttcc tggacctgag 216 0 

tcggccccac atgggaacct agtaccctct gctcta 2196 



<-210> 15 
<211> 1137 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> 575, 648 
<223> n = t or c 

<221> misc_feature 

<222> 771 

<223> n - g or c 

<221> misc_f eature 

<222> 383 

<223> n = g or a 

<221> misc_feature 
<222> 942 

<223> nucleotide at position 942 is c, or absent 

<221> misc_feature 
<222> 1052 
<223> n = a or c 



<400> 16 

gaattcaaac catggtttac taaactccaa agctggagcc cttctacagt ctcaggatct 6 0 

agaacaggga ttattactat ctctgctgtt gacatgagga aactgtggtt cagggaggtc 120 

aagtgacctg ccaaagcttg tacacatgga aagtagtaga accaggatgc aaacacattt 18 0 

ctttaccacc aacaccaata tctattttgc caacaaaaca atgagggggc ctgagtaaat 240 

aatctcaacg gttaactcca ccctccaatt gagatacttt tttttttttt ttttttttga 3 00 

gacagggtct ggctctctgt cacccaggct ggaatgcagc ggtgccctca gcttcccaag 3 60 

tagctaggac tacaggccac atgccaccat gcccagctaa tttttgtatt ttttgtagaa 42 0 

acagggtttt gccatattgc caaggctgtt ctcaaactcc tgggctcaag cagtcctcct 480 

gcctcagcct cctaaagtaa gagaagttgg aaggaaaatg ggtgaaaata aagaagttct 54 0 

cagttatact gcagcttgtt catgcctcct gcctngggac gccgcagtgg ctgccccagc 600 




zczgccczzz 


zagcczca.gc 




crrgaaggaga 


gaaaaagnga 


:::aacaaag 


660 


ugaggacrgt 


cagccctngg 


accntggacc 


z ntgagauct 


cacgacccac 


ccctcagcgz 


-20 


gtccaccagt 


gagagtggcu 


rzcaagggag 


agcgcgaagc 


acacgcggca 


nzgtcrcaca 


7 80 


ccacacctgc 


tgagtccaaa 


ccacgggagg 


ccccicccct 


agaccctgca 


t-cctgaaagc 


340 


tgcgtacccg 


agagctgcgg 


cccggctgca 


gggacacacc 


canggggagg 


agccgcaaec 


300 


gtgtctgggg 


ccccagccag 


gccggccgga 


gcncctgttt 


cncgctgctc 


tgccgcccge 


960 


ccggggtacc 


aacatggccc 


agaagcgncc 


ngccngcacc 


ctgaagccrg 


agcgcgccca 


1020 


gcagctgccg 


gtctgctccc 


aggaggccaa 


gnagtcagcc 


zaczgccccz 


acagccaccc 


1080 


ccctgtgggg 


gccgccccgc 


ccacccagga 


ggggagaacc 


utcaaaggta 


aaggtgg 


1137 



